LINEAR 
ALGEBRA 
Theorems and _— 


SS yy, 
= SS ae 
- 
= 
a 


. aS er = 
_~S 


= = 


x S@ 
ws 


LINEAR ALGEBRA — 
THEOREMS AND 
APPLICATIONS 


Edited by Hassan Abid Yasser 


Linear Algebra —- Theorems and Applications 
http://dx.doi.org/10.5772/3107 
Edited by Hassan Abid Yasser 


Contributors 

Matsuo Sato, Francesco Aldo Costabile, Elisabetta Longo, A. Amparan, S. Marcaida, |. Zaballa, 
Taro Kimura, Ricardo L. Soto, Daniel N. Miller, Raymond A. de Callafon, P. Cervantes, L.F. 
Gonzalez, F.J. Ortiz, A.D. Garcia, Pattrawut Chansangiam, Jadranka Mici¢, Josip Pedari¢, 
Abdulhadi Aminu, Mohammad Poursina, Imad M. Khan, Kurt S. Anderson 


Published by InTech 
Janeza Trdine 9, 51000 Rijeka, Croatia 


Copyright © 2012 InTech 

All chapters are Open Access distributed under the Creative Commons Attribution 3.0 license, 
which allows users to download, copy and build upon published articles even for commercial 
purposes, as long as the author and publisher are properly credited, which ensures maximum 
dissemination and a wider impact of our publications. After this work has been published by 
InTech, authors have the right to republish it, in whole or part, in any publication of which they 
are the author, and to make other personal use of the work. Any republication, referencing or 
personal use of the work must explicitly identify the original source. 


Notice 

Statements and opinions expressed in the chapters are these of the individual contributors and 
not necessarily those of the editors or publisher. No responsibility is accepted for the accuracy 
of information contained in the published chapters. The publisher assumes no responsibility for 
any damage or injury to persons or property arising out of the use of any materials, 
instructions, methods or ideas contained in the book. 


Publishing Process Manager Marijan Polic 
Typesetting InTech Prepress, Novi Sad 
Cover InTech Design Team 


First published July, 2012 
Printed in Croatia 


A free online edition of this book is available at www.intechopen.com 
Additional hard copies can be obtained from orders@intechopen.com 


Linear Algebra — Theorems and Applications, Edited by Hassan Abid Yasser 
p. cm. 
ISBN 978-953-51-0669-2 


Contents 


Chapter 1 


Chapter 2 


Chapter 3 


Chap 


Chap 


Chap 


er 4 


er 5 


er6 


Chapter 7 


Chapter 8 


Chapter 9 


Preface IX 


3-Algebras in String Theory 1 
Matsuo Sato 


Algebraic Theory of Appell Polynomials 
with Application to General Linear 
Interpolation Problem 21 

Francesco Aldo Costabile and Elisabetta Longo 


An Interpretation of Rosenbrock’s Theorem 
via Local Rings 47 
A. Amparan, S. Marcaida and |. Zaballa 


Gauge Theory, Combinatorics, and Matrix Models 
Taro Kimura 


Nonnegative Inverse Eigenvalue Problem 99 
Ricardo L. Soto 


Identification of Linear, 
Discrete-Time Filters via Realization 117 
Daniel N. Miller and Raymond A. de Callafon 


Partition-Matrix Theory Applied 

to the Computation of Generalized-Inverses 

for MIMO Systems in Rayleigh Fading Channels 
P. Cervantes, L.F. Gonzalez, F.J. Ortiz and A.D. Garcia 


Operator Means and Applications 163 
Pattrawut Chansangiam 


Recent Research 
on Jensen’s Inequality for Operators 189 
Jadranka Mici¢ and Josip Peéari¢ 


75 


137 


VI Contents 


Chapter 10 A Linear System of Both Equations 
and Inequalities in Max-Algebra 215 
Abdulhadi Aminu 


Chapter 11 Efficient Model Transition in Adaptive Multi-Resolution 
Modeling of Biopolymers 237 
Mohammad Poursina, Imad M. Khan and Kurt S. Anderson 


Preface 


The core of linear algebra is essential to every mathematician, and we not only treat 
this core, but add material that is essential to mathematicians in specific fields. This 
book is for advanced researchers. We presume you are already familiar with 
elementary linear algebra and that you know how to multiply matrices, solve linear 
systems, etc. We do not treat elementary material here, though we occasionally return 
to elementary material from a more advanced standpoint to show you what it really 
means. We have written a book that we hope will be broadly useful. In a few places 
we have succumbed to temptation and included material that is not quite so well 
known, but which, in our opinion, should be. We hope that you will be enlightened 
not only by the specific material in the book but also by its style of argument. We also 
hope this book will serve as a valuable reference throughout your mathematical 
career. 


Chapter 1 reviews the metric Hermitian 3-algebra, which has been playing important 
roles recently in sting theory. It is classified by using a correspondence to a class of the 
super Lie algebra. It also reviews the Lie and Hermitian 3-algebra models of M-theory. 
Chapter 2 deals with algebraic analysis of Appell polynomials. It presents the 
determinantal approaches of Appell polynomials and the related topics, where many 
classical and non-classical examples are presented. Chapter 3 reviews a universal 
relation between combinatorics and the matrix model, and discusses its relation to the 
gauge theory. Chapter 4 covers the nonnegative matrices that have been a source of 
interesting and challenging mathematical problems. They arise in many applications 
such as: communications systems, biological systems, economics, ecology, computer 
sciences, machine learning, and many other engineering systems. Chapter 5 presents 
the central theory behind realization-based system identification and connects the 
theory to many tools in linear algebra, including the QR-decomposition, the singular 
value decomposition, and linear least-squares problems. Chapter 6 presents a novel 
iterative-recursive algorithm for computing GI for block matrices in the context of 
wireless MIMO communication systems within RFC. Chapter 7 deals with the 
development of the theory of operator means. It setups basic notations and states some 
background about operator monotone functions which play important roles in the 
theory of operator means. Chapter 8 studies a general formulation of Jensen’s operator 
inequality for a continuous field of self-adjoint operators and a field of positive linear 


Preface 


mappings. The aim of chapter 9 is to present a system of linear equation and 
inequalities in max-algebra. Max-algebra is an analogue of linear algebra developed on 
a pair of operations extended to matrices and vectors. Chapter 10 covers an efficient 
algorithm for the coarse to fine scale transition in multi-flexible-body systems with 
application to biomolecular systems that are modeled as articulated bodies and 
undergo discontinuous changes in the model definition. Finally, chapter 11 studies the 
structure of matrices defined over arbitrary fields whose elements are rational 
functions with no poles at infinity and prescribed finite poles. Complete systems of 
invariants are provided for each one of these equivalence relations and the 
relationship between both systems of invariants is clarified. This result can be seen as 
an extension of the classical theorem on pole assignment by Rosenbrock. 


Dr. Hassan Abid Yasser 
College of Science 

University of Thi-Qar, Thi-Qar 
Iraq 


Chapter 1 


3-Algebras in String Theory 


Matsuo Sato 


Additional information is available at the end of the chapter 


http://dx.doi.org/10.5772/46480 


1. Introduction 


In this chapter, we review 3-algebras that appear as fundamental properties of string theory. 
3-algebra is a generalization of Lie algebra; it is defined by a tri-linear bracket instead of 
by a bi-linear bracket, and satisfies fundamental identity, which is a generalization of Jacobi 
identity [1-3]. We consider 3-algebras equipped with invariant metrics in order to apply them 
to physics. 


It has been expected that there exists M-theory, which unifies string theories. In M-theory, 
some structures of 3-algebras were found recently. First, it was found that by using u(N) © 
u(N) Hermitian 3-algebra, we can describe a low energy effective action of N coincident 
supermembranes [4-8], which are fundamental objects in M-theory. 


With this as motivation, 3-algebras with invariant metrics were classified [9-22]. Lie 3-algebras 
are defined in real vector spaces and tri-linear brackets of them are totally anti-symmetric in 
all the three entries. Lie 3-algebras with invariant metrics are classified into Ay algebra, and 
Lorentzian Lie 3-algebras, which have metrics with indefinite signatures. On the other hand, 
Hermitian 3-algebras are defined in Hermitian vector spaces and their tri-linear brackets are 
complex linear and anti-symmetric in the first two entries, whereas complex anti-linear in the 
third entry. Hermitian 3-algebras with invariant metrics are classified into u(N) © u(M) and 
sp(2N) © u(1) Hermitian 3-algebras. 


Moreover, recent studies have indicated that there also exist structures of 3-algebras in 
the Green-Schwartz supermembrane action, which defines full perturbative dynamics of a 
supermembrane. It had not been clear whether the total supermembrane action including 
fermions has structures of 3-algebras, whereas the bosonic part of the action can be described 
by using a tri-linear bracket, called Nambu bracket [23, 24], which is a generalization of 
Poisson bracket. If we fix to a light-cone gauge, the total action can be described by using 
Poisson bracket, that is, only structures of Lie algebra are left in this gauge [25]. However, it 
was shown under an approximation that the total action can be described by Nambu bracket 
if we fix to a semi-light-cone gauge [26]. In this gauge, the eleven dimensional space-time 
of M-theory is manifest in the supermembrane action, whereas only ten dimensional part is 
manifest in the light-cone gauge. 
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The BFSS matrix theory is conjectured to describe an infinite momentum frame (IMF) limit 
of M-theory [27] and many evidences were found. The action of the BFSS matrix theory can 
be obtained by replacing Poisson bracket with a finite dimensional Lie algebra’s bracket in 
the supermembrane action in the light-cone gauge. Because of this structure, only variables 
that represent the ten dimensional part of the eleven-dimensional space-time are manifest in 
the BFSS matrix theory. Recently, 3-algebra models of M-theory were proposed [26, 28, 29], 
by replacing Nambu bracket with finite dimensional 3-algebras’ brackets in an action that is 
shown, by using an approximation, to be equivalent to the semi-light-cone supermembrane 
action. All the variables that represent the eleven dimensional space-time are manifest in these 
models. It was shown that if the DLCQ limit of the 3-algebra models of M-theory is taken, they 
reduce to the BFSS matrix theory [26, 28], as they should [30-35]. 


2. Definition and classification of metric Hermitian 3-algebra 


In this section, we will define and classify the Hermitian 3-algebras equipped with invariant 
metrics. 


2.1. General structure of metric Hermitian 3-algebra 


The metric Hermitian 3-algebra is a map V x V x V — V defined by (x,y,z) > [x,y;z], where 
the 3-bracket is complex linear in the first two entries, whereas complex anti-linear in the last 
entry, equipped with a metric < x,y >, satisfying the following properties: 

the fundamental identity 


[[x,yrz], 070] = [[x, 0; w], ysz] + |x, [y, ow] 2] — [x,y [2,07 0]] (1) 


the metric invariance 


< [x,v;w],y >— < x,[y,w;v] >=0 (2) 
and the anti-symmetry 
[x,y;2z] = —[y, x;2] (3) 
for 
X,Y,z,0,w EV (4) 


The Hermitian 3-algebra generates a symmetry, whose generators D(x, y) are defined by 


D(x,y)z = [z, x;y] (5) 
From (1), one can show that D(x, y) form a Lie algebra, 


[D(x,y), D(v, w)] = D(D(x, y)v, w) — D(z, Diy, x)w) (6) 


There is an one-to-one correspondence between the metric Hermitian 3-algebra and a class of 
metric complex super Lie algebras [19]. Such a class satisfies the following conditions among 
complex super Lie algebras S = So © 51, where Sg and 5S; are even and odd parts, respectively. 
S; is decomposed as S; = V @ V, where V is an unitary representation of Sp: for a € So, 
u,vEeV, 

[au] eV (7) 
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and 
< [a,u],0 >+ <u, [a*,v] >=0 (8) 
a € Vis defined by 
G=< ,v> (9) 
The super Lie bracket satisfies 
[V,V]=0, [V,V] =0 (10) 


From the metric Hermitian 3-algebra, we obtain the class of the metric complex super Lie 
algebra in the following way. The elements in So, V, and V are defined by (5), (4), and (9), 
respectively. The algebra is defined by (6) and 


[D(x,y),z] = D(x, y)z = [z, x;y] 

[D(x,y),2] = —D(y,x)z = —[z,y;] 

[x,9] = D(x,y) 

[x,y] :=0 

[z,7] = 0 (11) 


One can show that this algebra satisfies the super Jacobi identity and (7)-(10) as in [19]. 


Inversely, from the class of the metric complex super Lie algebra, we obtain the metric 
Hermitian 3-algebra by 

[x,y;2z] = a[ly,Z], x] (12) 
where a is an arbitrary constant. One can also show that this algebra satisfies (1)-(3) for (4) as 
in [19]. 


2.2. Classification of metric Hermitian 3-algebra 


The classical Lie super algebras satisfying (7)-(10) are A(m —1,n — 1) and C(n+ 1). The even 
parts of A(m—1,n—1) and C(n +1) are u(m) @ u(n) and sp(2n) @ u(1), respectively. Because 
the metric Hermitian 3-algebra one-to-one corresponds to this class of the super Lie algebra, 
the metric Hermitian 3-algebras are classified into u(m) @ u(n) and sp(2n) 6 u(1) Hermitian 
3-algebras. 


First, we will construct the u(m) @ u(n) Hermitian 3-algebra from A(m — 1,n — 1), according 
to the relation in the previous subsection. A(m— 1,n—1) is simple and is obtained by dividing 
sl(m,n) by its ideal. That is, A(m —1,n—1) = sl(m,n) whenm ¢ nand A(n—1,n—1) = 


sl(n,n)/Algy. 
hy c 
(ia iS) Uo} 


where hy and hz are m X mand n x n anti-Hermite matrices and c is ann x m arbitrary complex 
matrix. Complex s/(m,n) is a complexification of real s!(m,n), given by 


8) a 


Real s/(m,n) is defined by 


Linear Algebra —- Theorems and Applications 


where a, B, y, and 6 arem x m,n X m,m x n,and n x n complex matrices that satisfy 
tra = trd (15) 
Complex A(m — 1,n — 1) is decomposed as A(m — 1,n—1) = So @V @ V, where 


(oye? (16) 


(9) is rewritten as V —> V defined by 


(38) (39) : 


where B € V and Bt € V. (12) is rewritten as 


er: 
[X, ¥;Z] = a{[Y, Z*], xX] =a sr v) dis 
for 
0x 
xe (22) ev 
Oy 
v=(98) ev 
0z 
z= (92) ev 
(19) 
As a result, we obtain the u(m) © u(n) Hermitian 3-algebra, 
[x, y;z] = a(yz'x — xzty) (20) 


where x, y, and z are arbitrary n x m complex matrices. This algebra was originally 
constructed in [8]. 


Inversely, from (20), we can construct complex A(m — 1,n —1). (5) is rewritten as 
D(x,y) = (xy",y*x) € So (21) 


(6) and (11) are rewritten as 


(xy y*x), (xy yxy] = ([xy?, x’y"), [y*x, y*x']) 


(xy",y*x),z] = xy"z — zy*x 
(xy",y*x),w"] =y*xw" 
x,y"] = (xy y*x) 

x,y] =0 

sy |=0 (22) 


—w'xyt 
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x T 4 x! It Zz! 
( eA vin) i Ot yw) | (23) 


which forms complex A(m—1,n— 1). 


This algebra is summarized as 


Next, we will construct the sp(2n) @ u(1) Hermitian 3-algebra from C(n + 1). Complex C(n + 
1) is decomposed as C(n +1) = Sp 6 V & V. The elements are given by 


EV 


oooo CcoooR:R 


ev (24) 


coso oo oR 
oosS co oo os 


ooo°o 


where « is a complex number, a is an arbitrary n x n complex matrix, b and c are n x n complex 


symmetric matrices, and x}, x2, y1 and y2 are n x 1 complex matrices. (9) is rewritten as V > V 
defined by B ++ B = UB*U~!, where B € V, B € Vand 


0100 
1000 
U1 90-04 G) 
00-10 
Explicitly, 
0 0 x, Xx 000 0 
{0 0 00 an 0 0x5 —x} 
Be lo gl 90.) = |= oo 0 ee) 
0-x7 0 0 -x300 0 
(12) is rewritten as 
[X, Y;Z] := af[Y, Z], X] 
00 ny 000 0 0 0 x Xx 
aa 00 00 0 Oz —z% 00 00 
_ 0y 00 ]’|-ztoo oO a ee ee) 
0-yi 00 -z500 0 0-xi 0 0 
0 0 W, W2 
00 00 
~ “lo wl 0 0 2 
0-wi 0 0 
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for 


coos cooR 
coos coos 


eV (28) 


cooo coeo Sseee 
oo 


where w, and wp are given by 
(wy, wa) = —(yrzq + y223) (x1, ¥2) + (x1z] + 2223) (Yayo) + (x27 — x192)(Z,-23) (29) 


As a result, we obtain the sp(2n) © u(1) Hermitian 3-algebra, 
[x,y;z] =a((y@©Z)x+ (ZO x)y— (x@y)zZ) (30) 
for x = (x1,x2),¥ = (Y1,Y2),Z = (Z1,Z2), where x1, Xo, Y1, Y2, 21, and Zz are n-vectors and 
z= (,-2) 
a@b=a,-bo—an-by (31) 
3. 3-algebra model of M-theory 


In this section, we review the fact that the supermembrane action in a semi-light-cone gauge 
can be described by Nambu bracket, where structures of 3-algebra are manifest. The 3-algebra 
Models of M-theory are defined based on the semi-light-cone supermembrane action. We also 
review that the models reduce to the BFSS matrix theory in the DLCQ limit. 


3.1. Supermembrane and 3-algebra model of M-theory 


The fundamental degrees of freedom in M-theory are supermembranes. The action of the 
covariant supermembrane action in M-theory [36] is given by 


Sm2 = [@o(v-G+ cePT ET nda ¥ (I1gMILN oi slg PINO,¥ 
1 apMy wan 
— SW Mag YT an ¥)) (32) 


where M,N = 0,--- ,10, a, 8, ¥ = 0,1,2, Gug = TyMT Igy and I1,@ = 0,.XM — 5¥1™0,¥. ¥ 
is a SO(1,10) Majorana fermion. 
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This action is invariant under dynamical supertransformations, 


ov =e 
5X” — —~j9TMe (33) 


These transformations form the V = 1 supersymmetry algebra in eleven dimensions, 


(51, 59] xm = —2ie Me, 


[61, oJ ¥ = 0 (34) 
The action is also invariant under the x-symmetry transformations, 


b6¥ = (1+T)x(c) 


OXM — IM (1+T)x(c) (35) 
where i 
= LAMAN 
T= ' Fee as TT imMn (36) 


If we fix the x-symmetry (35) of the action by taking a semi-light-cone gauge [26]! 
eae: A (37) 
we obtain a semi-light-cone supermembrane action, 


: 1. = 
gp YT'0,¥ — Tp Pap ¥¥T'07¥) 


+9 j3q'¥9gX!d,X! )) (38) 


Spe re Po(V—G+ zetb (80 ya (TIg'T1," + su 


where Gyg = hyp + Tq" gy, Ug" = Og X" — 5 PTH Oqg¥, and yg = qX!dgX1. 


In [26], it is shown under an approximation up to the quadratic order in dq X" and d,¥ but 
exactly in X! that this action is equivalent to the continuum action of the 3-algebra model of 
M-theory, 


1 1 
Sa = f BoJ=3(—G{X' XI XK — 5 (Apel e", 9X1}? 


Lay 1 
~ get AyavAved Ares LP", p, pe }{", p gl} . 54 


= Le 
STM Awol 9" 9 Fh + FHT {X!, XI, ¥}) (39) 
where I,J,K = 3,---,10 and {o%, 9", p°} = eB 1A, PAB p04 9° is the Nambu-Poisson 
bracket. An invariant symmetric bilinear form is defined by [ Po /—go'g? for complete 


basis p" in three dimensions. Thus, this action is manifestly VPD covariant even when the 
world-volume metric is flat. X! is a scalar and ¥ is a SO(1,2) x SO(8) Majorana-Weyl fermion 


' Advantages of a semi-light-cone gauges against a light-cone gauge are shown in [37-39] 
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satisfying (37). EPA is a Levi-Civita symbol in three dimensions and A is a cosmological 
constant. 


The continuum action of 3-algebra model of M-theory (39) is invariant under 16 dynamical 
supersymmetry transformations, 


6x =i ¥ 
i 
6A,(0,0') = pel ul (X10) ¥(0') =X'(@)¥ie)), 
1 
bY = Ay {o", 9", X' STMT ye — p(X! X!, XSW Ke (40) 
where I'p12€ = —e. These supersymmetries close into gauge transformations on-shell, 
[61,52]X! = Acal g*, 94, X"} 
61, 62) Anab{ @", 9”, } ad Aart 9", g”, Aycal 9, 9", }} 
Anat", "Meal P, 9", i + 2ié:T¥ e1O%ny 
: : i 
[51,52] ¥ = Acal 9, 94, ¥} + (ieTMeyIy, — get er KL)O* (41) 
where gauge parameters are given by Agy = 21€2T¥ ey Ayab = iT xe X) XK. OF, = 0 and 
Of =Oare equations of motions of A, and Y, respectively, where 
Of = Apart 9", 9”, Aveal 9, g", }} — Avant 9", 9”, Aycal 9 g, tae 
Lae 
+Ewi({XL Ao ge" Xp ASE Y, }) 
1 
OF = -I* Aya {9", 9", ¥} + sly{X, XI, 9} (42) 
(41) implies that a commutation relation between the dynamical supersymmetry 


transformations is 
5701 — 6157 = 0 (43) 


up to the equations of motions and the gauge transformations. 


This action is invariant under a translation, 
bx") =4', SAH (7,0) = yh (0) — (0) (44) 
where 77! are constants. 


The action is also invariant under 16 kinematical supersymmetry transformations 
bv =é (45) 


and the other fields are not transformed. é is a constant and satisfy [oi2é = €. € and € 
should come from sixteen components of thirty-two NV = 1 supersymmetry parameters in 
eleven dimensions, corresponding to eigen values +1 of T'g12, respectively. This V = 1 
supersymmetry consists of remaining 16 target-space supersymmetries and transmuted 16 
x-symmetries in the semi-light-cone gauge [25, 26, 40]. 
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A commutation relation between the kinematical supersymmetry transformations is given by 
5y61 — 5149 = 0 (46) 
A commutator of dynamical supersymmetry transformations and kinematical ones acts as 
(6961 — 5449) X! (0) = i€T/& = no 
(5951 — 5152) AM(o,0") = SerT"T1(X"(0) — X(0'))ér = ho) —mh(o’) (47) 
where the commutator that acts on the other fields vanishes. Thus, the commutation relation 
is given by : 7 
0261 — 6102 = dy (48) 
where 6, is a translation. 
If we change a basis of the supersymmetry transformations as 
b= 5+6 
5’ = i(6 — 4) (49) 
we obtain 
5564 — 6465 = oy 
8781 — 0455 = by 
54,54 — 6165 =0 (50) 
These thirty-two supersymmetry transformations are summarised as A = (0/ ,o') and (50) 
implies the N=1 supersymmetry algebra in eleven dimensions, 


AgA; — AyAg = 8 (51) 
7] 


3.2. Lie 3-algebra models of M-theory 


In this and next subsection, we perform the second quantization on the continuum action of 
the 3-algebra model of M-theory: By replacing the Nambu-Poisson bracket in the action (39) 
with brackets of finite-dimensional 3-algebras, Lie and Hermitian 3-algebras, we obtain the 
Lie and Hermitian 3-algebra models of M-theory [26, 28], respectively. In this section, we 
review the Lie 3-algebra model. 


If we replace the Nambu-Poisson bracket in the action (39) with a completely antisymmetric 
real 3-algebra’s bracket [21, 22], 
| Po/— go ( ) 


(oe OS rT | (52) 
we obtain the Lie 3-algebra model of M-theory [26, 28], 


1 1 
Sy (=X) a0 ee 5 (Apablt", TY, X1])? 
— FE Apap Avet Ares T, T, 7 re T, T!)] 


= SET" Apap [T*, T?,¥] + ay [xt x/ ¥]) (53) 
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We have deleted the cosmological constant A, which corresponds to an operator ordering 
ambiguity, as usual as in the case of other matrix models [27, 41]. 
This model can be obtained formally by a dimensional reduction of the NV = 8 BLG model 
[4-6], 
1 1 1 b d 
Syassic = f dx(— FIX XI, XK? — 5 (Dy X!? — EMA (5 Auap®v AncaT(T?, Te, Th] 


1 
a5 zAuabA ved Ares |T", i, fig as rT, T!)) 


i ig 
PS eT Dy¥ + F481 y[X!, XI, ¥]) (54) 


The formal relations between the Lie (Hermitian) 3-algebra models of M-theory and the NV = 8 
(N = 6) BLG models are analogous to the relation among the VV = 4 super Yang-Mills in four 
dimensions, the BFSS matrix theory [27], and the IIB matrix model [41]. They are completely 
different theories although they are related to each others by dimensional reductions. In the 
same way, the 3-algebra models of M-theory and the BLG models are completely different 
theories. 


The fields in the action (53) are spanned by the Lie 3-algebra T” as xi = x! oo = FT" 
and A! = AL ® T°, where I = 3,--- ,10 and py = 0,1,2. <> represents a metric for the 
3-algebra. Y is a Majorana spinor of SO(1,10) that satisfies [oj = Y. EA is a Levi-Civita 
symbol in three-dimensions. 


Finite dimensional Lie 3-algebras with an invariant metric is classified into four-dimensional 
Euclidean A, algebra and the Lie 3-algebras with indefinite metrics in [9-11, 21, 22]. We do 
not choose A4 algebra because its degrees of freedom are just four. We need an algebra with 
arbitrary dimensions N, which is taken to infinity to define M-theory. Here we choose the 
most simple indefinite metric Lie 3-algebra, so called the Lorentzian Lie 3-algebra associated 
with u(N) Lie algebra, 


(rhe r|=0 
ee ee ae ee 
[TTT] = [TT] =f ae 
[ri 7,7] = fr (65) 
where a = —1,0,i (i = 1,--- , N?). T’ are generators of u(N). A metric is defined by a 
symmetric bilinear form, 
a) a ie ee | (56) 
<T1,Ti>=hi (57) 


and the other components are 0. The action is decomposed as 


1 1 1 1 
o= Te(=3 (eg) [et 2! P a 5 (xo[x1,x/])° = 5 (xobu + [@urx'])? — se by [av,a,] 


+iFol" uh — 5G" lay,¥] + 5x80 le, 9] — SGOT ylewI]p) 68) 
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where we have renamed x} > oe bev > x1, ¥%o > Yo, ¥,Ti > Y, 2A wit’ + ay, and 
A uij [T’, T/] > by. ay correspond to the target coordinate matrices X", whereas by are auxiliary 
fields. 


In this action, T~! mode; X! y, ¥_1 or At 4q does not appear, that is they are unphysical 
modes. Therefore, the indefinite part of the metric (56) does not exist in the action and the Lie 
3-algebra model of M-theory is ghost-free like a model in [42]. This action can be obtained 
by a dimensional reduction of the three-dimensional NV = 8 BLG model [4-6] with the same 
3-algebra. The BLG model possesses a ghost mode because of its kinetic terms with indefinite 
signature. On the other hand, the Lie 3-algebra model of M-theory does not possess a kinetic 
term because it is defined as a zero-dimensional field theory like the IIB matrix model [41]. 


This action is invariant under the translation 

éxi = 7, dat = yh (59) 
where 7/ and y belong to u(1). This implies that eigen values of x! and a! represent an 
eleven-dimensional space-time. 


The action is also invariant under 16 kinematical supersymmetry transformations 
dp=ée (60) 


and the other fields are not transformed. € belong to u(1) and satisfy Tpj.é = €. € and € 
should come from sixteen components of thirty-two NV = 1 supersymmetry parameters in 
eleven dimensions, corresponding to eigen values +1 of I'o12, respectively, as in the previous 
subsection. 


A commutation relation between the kinematical supersymmetry transformations is given by 
5761 — 64d) = 0 (61) 
The action is invariant under 16 dynamical supersymmetry transformations, 
dX! = ial ¥ 
bAyay(T7,T’, ] = ie yl [X,¥, |] 
1 
OY = —Ayap[T*, T?, X'|0"T ye — glx", XI, X®\Pyxe (62) 
where I'pj€ = —e. These supersymmetries close into gauge transformations on-shell, 
(51, 62] X! = Agg[T*, T?, X"] 
[51,62] Anab [Ee ; vue ] = Mab ae i ba Aucd (T°, 7: ]] 


—Ayab(T*, 7’, AcalT%,T4, J] + 2i@2T¥e1Ofy 


[51,52] ¥ = Agg[T®, T4, ¥] + (eT Me Ty — geet esx )O* (63) 
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where gauge parameters are given by Ajy = 2i€2I"€)Ayqy — iT jee X)XK. OF = 0 and 
OF = Oare equations of motions of A,, and Y, respectively, where 
CH A2ae Aa. lAal Ag. | 
$e Agi), ery, 2) 
OF = -I¥Aygp[T*, T?, ¥] + sh X!,xI,¥] (64) 


(63) implies that a commutation relation between the dynamical supersymmetry 
transformations is 
5761 — 6157 = 0 (65) 
up to the equations of motions and the gauge transformations. 
The 16 dynamical supersymmetry transformations (62) are decomposed as 
éx! = ial yp 
dx), = iT 


big =7eF hy 


op = —(by xh + [ay,x']) TPT ye — ste x8 Pyxe 
do = 0 

1 
bp_y = —Tr(byx! TMT ye — g(x! x! )x4 PiyKe 


bay = iT I 1 (xgp a pox") 
bby = ie pT [x", Y] 


- 1 
SA ai = EET 15 (x4 fi — P-14}) 


= 1 
BAy—10 = HT pT 15 (x14 fo — P19) (66) 
and thus a commutator of dynamical supersymmetry transformations and kinematical ones 
acts as 
(396, = 5469) x! = i€\Tl@ = yi 
(Con -_ 6169) a" _ i€ TT xh é> = qt 
~ s a 3 
(0201 — 559) A" | .T' = sierT MT px! 12 (67) 
where the commutator that acts on the other fields vanishes. Thus, the commutation relation 
for physical modes is given by : ’ 
0261 — 6102 = dy (68) 
where 6, is a translation. 


(61), (65), and (68) imply the VV = 1 supersymmetry algebra in eleven dimensions as in the 
previous subsection. 
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3.3. Hermitian 3-algebra model of M-theory 


In this subsection, we study the Hermitian 3-algebra models of M-theory [26]. Especially, we 
study mostly the model with the u(N) @ u(N) Hermitian 3-algebra (20). 


The continuum action (39) can be rewritten by using the triality of SO(8) and the SU(4) x U(1) 
decomposition [8, 43, 44] as 


ei [ ®oV=e(-V - AyalZ4,T TAL Zab, Tey 


1 
+3 EM Auta AvdcAagelT, cae sige Waar ee, ig 


+if4T* Anpa{pa,T*, T?} + Banco ¥4{Z°, 2°, p®} = SEBO ZL Fa, pp, Zc} 
—ip“{pa,Z?, Zp} + 2iPA{ pp, 2°, Z,}) (69) 


where fields with a raised A index transform in the 4 of SU(4), whereas those with lowered 
one transform in the 4. A uba (# = 0,1,2) is an anti-Hermitian gauge field, ZA and Z, area 
complex scalar field and its complex conjugate, respectively. yy, is a fermion field that satisfies 


Tb 4 = —Pa (70) 


and yA is its complex conjugate. EPA and E4BCD are Levi-Civita symbols in three dimensions 
and four dimensions, respectively. The potential terms are given by 


2. CcDyB 
V = SYP Yép 


1 1 
Vp” = {2°,2?, Zp} — 565{Z",Z", Ze} + sop {Z",Z°, Ze} (71) 


If we replace the Nambu-Poisson bracket with a Hermitian 3-algebra’s bracket [19, 20], 

/ Bo./— go ( ) 

{9% 9°, 9°} > [T. T°; T (72) 
we obtain the Hermitian 3-algebra model of M-theory [26], 


ab md 1 en (Serpe Deerarey Teor 
S= (-v = Aiba [Z, TS. T) Ab [Za,T¢; T4] + gE Aig AyacAagell™, T°; T (T?, Th. T?] 


2s op i = = i Le 7 
+iP “THA, bg [Wa, hag T”] — 5 Esco} ze Zh p”] = se ea Pa, YB; Zc] 
— if (pa, 2°; Zp] + 219 (pp, 2°; Z4]) (73) 


where the cosmological constant has been deleted for the same reason as before. The potential 
terms are given by 
2. cDvB 
V=3%8 Yep 


YG? = [Z°, 2°; Z5] — ara Z?; Ze] + sR bag age A (74) 
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This matrix model can be obtained formally by a dimensional reduction of the VW = 6 BLG 
action [8], which is equivalent to ABJ(M) action [7, 45/7, 
— 1 -7 ae 
SN=6BLG = [@x(-v = Dy ZO DP Za, + EM (5 AycbvA gat [T", eT 
1 =: ia. a ee 
1 3A ubaAvdeAr fell, TGP I, TS; T?)) 
ss i - i a - 
iP T!Dytpa + 5 Easco# [2° 29; 9] — 5B“? ? Zyl Ba, bp: Zc] 
—if*[pa, 2°; Zp] + 214 (ps, Z°;Za]) (75) 
The Hermitian 3-algebra models of M-theory are classified into the models with u(m) © u(n) 
Hermitian 3-algebra (20) and sp(2n) © u(1) Hermitian 3-algebra (30). In the following, 


we study the u(N) @ u(N) Hermitian 3-algebra model. By substituting the u(N) @ u(N) 
Hermitian 3-algebra (20) to the action (73), we obtain 


$= Tr( Gry v—(z4ak ALZAY(ZA ARH Aluzayt_ ET pwr ab ab at — Ab al at) 
— PT" (paAp — Apa) + =F iB asco 4Z°ptZ — iEABCD Zi pt Zip 
=p py Zhe + pA 287i ha + 2rb ge 2" = 2i ZZ pp)) (76) 
where AY = — FIA, 5, TPT" and Ali = — BIA, 5 TT? are N x N Hermitian matrices. In 


the algebra, we have set « = ae where k is an integer representing the Chern-Simons level. 
We choose k = 1 in order to obtain 16 dynamical supersymmetries. V is given by 


v= +g ZhzAzhzPztz0 + sha zi ah zz. + aA ZPZZAZE Ze 
SE ee ae aoe ee (77) 
By redefining fields as 
3 
ZA (=) i 
270 
AE 2n\5 Al! 
k 
a_, {5 * A (78) 
p re ees 


we obtain an action that is independent of Chern-Simons level: 
= Tr(-V ata aA Za aaa a EMV ARATAR — ALATA) 
— PT (pa Ag — Ala) + iEspcph4Z°y'8Z? — EAB Zh yt Zbbp 
~iG paZhZ® + iG Z?Zhwa + 2ifp_ZhZ" — 21642" Zh pp) (79) 


? The authors of [46-49] studied matrix models that can be obtained by a dimensional reduction of the ABJM and ABJ 
gauge theories on $3. They showed that the models reproduce the original gauge theories on $? in planar limits. 
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as opposed to three-dimensional Chern-Simons actions. 


If we rewrite the gauge fields in the action as Al = A, +b, and At = A, — by, we obtain 


ree ee 
S= Tr(-V at ((Au, Z*) a5 {Dy, Z4})((A", Za] _ {er Za}) TEM (= bybvba 2A, Avby) 
+pT" ((Ay, pal + (by, Pa}) + iEsscph4 Zo pt Z? — EAB Zh pt , Zh yp 
— ip prZ5Z? + ip ZZ pa + 2p paZ4Z® — 2ip4Z9Z5 pp) (80) 


where [, | and { , } are the ordinary commutator and anticommutator, respectively. The 
u(1) parts of A” decouple because A” appear only in commutators in the action. b! can be 
regarded as auxiliary fields, and thus A” correspond to matrices X" that represents three 
space-time coordinates in M-theory. Among N x N arbitrary complex matrices ZA, we need 
to identify matrices X! (I = 3,---10) representing the other space coordinates in M-theory, 
because the model possesses not SO(8) but SU(4) x U(1) symmetry. Our identification is 


x! = X! inl (81) 


where X! and x! are su(N) Hermitian matrices and real scalars, respectively. This is analogous 
to the identification when we compactify ABJM action, which describes N M2 branes, and 
obtain the action of N D2 branes [7, 50, 51]. We will see that this identification works also in our 
case. We should note that while the su(N) part is Hermitian, the u(1) part is anti-Hermitian. 
That is, an eigen-value distribution of X", ZA, and not X! determine the spacetime in the 
Hermitian model. In order to define light-cone coordinates, we need to perform Wick rotation: 
a° —» —ia®. After the Wick rotation, we obtain 


A® = Ao — ig (82) 


where A) is a su(N) Hermitian matrix. 


3.4. DLCQ Limit of 3-algebra model of M-theory 


It was shown that M-theory in a DLCQ limit reduces to the BFSS matrix theory with matrices 
of finite size [30-35]. This fact is a strong criterion for a model of M-theory. In [26, 28], it was 
shown that the Lie and Hermitian 3-algebra models of M-theory reduce to the BFSS matrix 
theory with matrices of finite size in the DLCQ limit. In this subsection, we show an outline 
of the mechanism. 


DLCQ limit of M-theory consists of a light-cone compactification, x ~ x~ + 27tR, where 
i = g(x" + x°), and Lorentz boost in x!° direction with an infinite momentum. After 


appropriate scalings of fields [26, 28], we define light-cone coordinate matrices as 


x? = F(x =x) 
x Fix" + X7) (83) 


We integrate out b" by using their equations of motion. 
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A matrix compactification [52] on a circle with a radius R imposes the following conditions on 


X~ and the other matrices Y: 
X- —(2nR)1=u'xX-u 
Y =utyu 


(84) 


where U is a unitary matrix. In order to obtain a solution to (84), we need to take N — oo and 
consider matrices of infinite size [52]. A solution to (84) is given by X~ = X~ + xX,Y=Y 


and 
01 0 
01 
U= 01 @ lnxn € U(N) (85) 
0 07 
Backgrounds X~ are 
X~ = —T3xp T° — (27R)diag(---,s—1,8,8+1,--+) @Inxn (86) 
in the Lie 3-algebra case, whereas 
(87) 


X- = -i(T’x-)1 — i(2nR)diag(--- ,s—1,s,5+1,--+) @lnxn 


in the Hermitian 3-algebra case. A fluctuation % that represents u(N) parts of X~ and Y is 


-X(-1) #0) (1) 
#(=2) #(—1) 20) #0) 2(2 
(—2) ) £(0) %(1) . (88) 


Each ¥(s) is an x n matrix, where s is an integer. That is, the (s, t)-th block is given by 
f., = ¥(s—t). 
We make a Fourier transformation, 


7 1 27R ist 
X(s) = se f dtx(T)e*R (89) 
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where x(T) is am X 1 matrix in one-dimension and RR = 27. From (86)-(89), the following 
identities hold: 


Paar 1 pe i(s—t)t 
[x ¢len= seh dt drx(T)e R (90) 


where tr is a trace over n x n matrices and V = ), 1. 


Next, we boost the system in x10 direction: 


a es 
xt as = Xr 
T 
Pr ae (91) 


The DLCQ limit is achieved when T — oo, where the "novel Higgs mechanism" [51] is 
realized. In T — ov, the actions of the 3-algebra models of M-theory reduce to that of the 
BFSS matrix theory [27] with matrices of finite size, 


ae Z py2_ 1p 972, 1 gro _ tap 
$= oy | atte(g (Dox?) — glx?, °F + 591 Doy — 590? (xp, ¥) (92) 
where P,Q = 1,2,--- ,9. 


3.5. Supersymmetric deformation of Lie 3-algebra model of M-theory 


A supersymmetric deformation of the Lie 3-algebra Model of M-theory was studied in [53] 
(see also [54—56]). If we add mass terms and a flux term, 


1 a 
Sm = (5 (x1? — 5 HET 3456 F + AiyK1 [x",x/, xX]x") (93) 
such that 


Hier = { —Ferxr (LJ, K,L = 3,4,5,6 or 7,8,9, 10) 
an 0 (otherwise) 


to the action (53), the total action Sq + Si is invariant under dynamical 16 supersymmetries, 


(94) 


ox! = iarl¥ 
Arg T 1=e lee. | 
oY = —2(x1, 1, X[Pijxe — Ayap[T*, T°, X"|T#T pe + pT 3456X'T ye (95) 


From this action, we obtain various interesting solutions, including fuzzy sphere solutions 
[53]. 


ie 
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4. Conclusion 


The metric Hermitian 3-algebra corresponds to a class of the super Lie algebra. By using this 
relation, the metric Hermitian 3-algebras are classified into u(m) @ u(n) and sp(2n) @ u(1) 
Hermitian 3-algebras. 


The Lie and Hermitian 3-algebra models of M-theory are obtained by second quantizations 
of the supermembrane action in a semi-light-cone gauge. The Lie 3-algebra model possesses 
manifest \’ = 1 supersymmetry in eleven dimensions. In the DLCQ limit, both the models 
reduce to the BFSS matrix theory with matrices of finite size as they should. 
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Algebraic Theory of Appell Polynomials 
with Application to General Linear 
Interpolation Problem 
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1. Introduction 


In 1880 P. E. Appell ([1]) introduced and widely studied sequences of n-degree polynomials 
An (x), n = 0,1)... (1) 
satisfying the differential relation 
DAy (x) = NAy_1(x), n = 1,2,... (2) 


Sequences of polynomials, verifying the (2), nowadays called Appell polynomials, have 
been well studied because of their remarkable applications not only in different branches 
of mathematics ([2, 3]) but also in theoretical physics and chemistry ([4, 5]). In 1936 an 
initial bibliography was provided by Davis ([6, p. 25]). In 1939 Sheffer ([7]) introduced a 
new class of polynomials which extends the class of Appell polynomials; he called these 
polynomials of type zero, but nowadays they are called Sheffer polynomials. Sheffer also 
noticed the similarities between Appell polynomials and the umbral calculus, introduced 
in the second half of the 19th century with the work of such mathematicians as Sylvester, 
Cayley and Blissard (for examples, see [8]). The Sheffer theory is mainly based on formal 
power series. In 1941 Steffensen ([9]) published a theory on Sheffer polynomials based on 
formal power series too. However, these theories were not suitable as they did not provide 
sufficient computational tools. Afterwards Mullin, Roman and Rota ([10—-12]), using operators 
method, gave a beautiful theory of umbral calculus, including Sheffer polynomials. Recently, 
Di Bucchianico and Loeb ([13]) summarized and documented more than five hundred old and 
new findings related to Appell polynomial sequences. In last years attention has centered on 
finding a novel representation of Appell polynomials. For instance, Lehemer ([14)]) illustrated 
six different approaches to representing the sequence of Bernoulli polynomials, which is a 
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special case of Appell polynomial sequences. Costabile ([15, 16]) also gave a new form of 
Bernoulli polynomials, called determinantal form, and later these ideas have been extended 
to Appell polynomial sequences. In fact, in 2010, Costabile and Longo ([17]) proposed an 
algebraic and elementary approach to Appell polynomial sequences. At the same time, Yang 
and Youn ([18]) also gave an algebraic approach, but with different methods. The approach 
to Appell polynomial sequences via linear algebra is an easily comprehensible mathematical 
tool, specially for non-specialists; that is very good because many polynomials arise in 
physics, chemistry and engineering. The present work concerns with these topics and it is 
organized as follows: in Section 2 we mention the Appell method ([1]); in Section 3 we provide 
the determinantal approach ([17]) and prove the equivalence with other definitions; in Section 
4 classical and non-classical examples are given; in Section 5, by using elementary tools of 
linear algebra, general properties of Appell polynomials are provided; in Section 6 we mention 
Appell polynomials of second kind ([19, 20]) and, in Section 7 two classical examples are given; 
in Section 8 we provide an application to general linear interpolation problem([21]), giving, in 
Section 9, some examples; in Section 10 the Yang and Youn approach ([18]) is sketched; finally, 
in Section 11 conclusions close the work. 


2. The Appell approach 


Let {An(x)},, be a sequence of n-degree polynomials satisfying the differential relation (2). 
Then we have 


Remark 1. There is a one-to-one correspondence of the set of such sequences {Ay(x)},, and the set of 
numerical sequences {an},,, Xo A 0 given by the explicit representation 


n 


An (x) = an + (1 


) ans + (| ) ana? =peeaie fe ax", n= 0, 1, on (3) 


Equation (3), in particular, shows explicitly that for each n > 1 the polynomial Ay (x) is 
completely determined by A,,_1 (x) and by the choice of the constant of integration a. 


Remark 2. Given the formal power series 


h h2 h" 
a(h) = ao + 7701 ater ale” tes, a9 #0, (4) 


with «; i = 0,1,... real coefficients, the sequence of polynomials, An(x), determined by the power series 
expansion of the product a (h) e!™, i.e. 


2 n 
ae? = Ay G+ ae) i Ap (x) poet An (2) inst y (5) 


satisfies (2). 


The function a (1) is said, by Appell, “generating function’ of the sequence {An (x)},,- 
Appell also noticed various examples of sequences of polynomials verifying (2). 


He also considered ([1]) an application of these polynomial sequences to linear differential 
equations, which is out of this context. 
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3. The determinantal approach 
Let be Bj € R, i=0,1,..., with Bo 4 0. 
We give the following 


Definition 1. The polynomial sequence defined by 


Ao (x) = By 
1 x x2 Si Had xn-l xt 
Bo Br Bo c++: Bn—1 Bn 
0 Bo (7) Bi aa Ca eee ()Bn—1 
Ane) = wt, |0 0 Po CNBe3 OPr2] pias, 
0 0 Bo (ny) B1 


is called Appell polynomial sequence for f;. 


Then we have 
Theorem 1. If Ay (x) is the Appell polynomial sequence for f; the differential relation (2) holds. 
Proof. Using the properties of linearity we can differentiate the determinant (6), expand the 


resulting determinant with respect to the first column and recognize the factor A;,_, (x) after 
multiplication of the i-th row by i — 1, i = 2,...,n and j-th column by i PH Ag 


Theorem 2. If Ay (x) is the Appell polynomial sequence for 6; we have the equality (3) with 


fie a (7) 
Br Bo elie, eae Bi-1 Bi 
Bo (G)Bi +++ (7) Bi-2 (1) Bi-1 
nai 0 Bo «r++: (51) Bi-3 (5) Bi-2 
wo - 
O sae «64.0 Bo (,/4)B1 
i-1/; 
- “ y (1) Bike, i= 1,2,...n. (8) 
k=0 


24 Linear Algebra — Theorems and Applications 


Proof. From (6), by expanding the determinant A, (x) with respect to the first row, we obtain 
the (3) with a; given by (7) and the determinantal form in (8); this is a determinant of an upper 
Hessenberg matrix of order i ([16]), then setting #; = (—1) (Bp)? «; fori = 1,2,...,n, we 


have 
i-1 ; 
= Y) (-1)°*" hey ie (i) Me, (9) 
k=0 
where: 
Bin for! = 1, 
tim = § (()Bm—t41 fort << m+1, Lm =1,2,..,i, (10) 
0 for] >m+1, 
i—k—1 
qx (i) = [I hjj-1 = (Bo) , k=01,...,i—2, (11) 
ja=k+2 
gii() =1. (12) 


By virtue of the previous setting, (9) implies 


aj = roy (.) Biz (Bo) ©? & + (, - :) Bia = 


ee ee ee 
= (-1)' (fo) ( fy 2 (x) \) 


and the proof is concluded. 


Remark 3. We note that (7) and (8) are equivalent to 


) Li=0 
ve | Bi-nee = { (13) 
0 (. 0i>0 
and that for each sequence of Appell polynomials there exist two sequences of numbers a; and B; related 
by (13). 
Corollary 1. If Ay (x) is the Appell polynomial sequence for B; we have 


n 


An (x) =o (") Ay; (0) x/, 1 =0,1,... (14) 


j=o J 


Proof. Follows from Theorem 2 being 


A; (0) =a;, i=0,1,...,n. (15) 


Remark 4. For computation we can observe that xn is a n-order determinant of a particular upper 
Hessenberg form and it’s known that the algorithm of Gaussian elimination without pivoting for 
computing the determinant of an upper Hessenberg matrix is stable ([22, p. 27]). 
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Theorem 3. If a(h) is the function defined in (4) and A, (x) is the polynomial sequence defined by 
(5), setting 


Xo 
n (16) 
1 
Bn = —s e (7) seb] , n=1,2,..,, 


we have that An (x) satisfies the (6), i.e. An(x) is the Appell polynomial sequence for B;. 


Proof. Let be 
h h h" 
blk) = Bot 5, Bit ap P2 bert Ba (17) 
with B,, as in (16). Then we have a (h) b (h) = 1, where the product is intended in the Cauchy 


sense, i.e.: 


Let us multiply both hand sides of equation 
h — hn 
a(h)e"* = iz An (x) ne (18) 
n=0 . 


hx 


1 5 : : 1 : : 
for ——~ and, in the same equation, replace functions e’* and ——~ by their Taylor series 


a(n) a (h) 
expansion at the origin; then (18) becomes 
co x" hn co h” co hh” 
= Vi An(x) — YS Bn. (19) 
XL n! n=0 n} n=0 n} 


By multiplying the series on the left hand side of (19) according to the Cauchy-product 
rules, previous equality leads to the following system of infinite equations in the unknown 
An (x),n=0,1,... 


Ao (x) Bo = 1, 
Apo (x) B1 + At (x) Bo = x, 


Ao (x) Bz + (7) At (x) Bi + A2 (x) Bo = 22, 
(20) 


Ao (x) Bu + (Aa (x) Bn—a +. + An (x) Bo = x", 
From the first one of (20) we obtain the first one of (6). Moreover, the special form of the 


previous system (lower triangular) allows us to work out the unknown Ay, (x) operating with 
the first n + 1 equations, only by applying the Cramer rule: 
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Bo 0 0 1 

Bi Bo O--- 0 x 

bo (Z)B1 fo-; 0 x 
An (2) = Gyr] 

Bn-1 Ce Pnet ae Bo a 

Bn (7)Bn—1 ens (n1)P1 x 


By transposition of the previous, we have 


Bo Bi Bo as Bn-1 Bn 
0 Bo (7) B1 a Cy baa (Pea 
Ay (x) = ae ae oe | | » REDS (21) 
00 0 + By (%)b1 
1x x2 -e- xt ait 


that is exactly the second one of (6) after n circular row exchanges: more precisely, the i-th row 
moves to the (i + 1)-th position for i = 1,...,n — 1, the n-th row goes to the first position. 


Definition 2. The function a(h) e, as in (4) and (5), is said ’generating function’ of the Appell 
polynomial sequence An (x) for Bj. 


Theorems 1, 2, 3 concur to assert the validity of following 


Theorem 4 (Circular). If Ay (x) is the Appell polynomial sequence for 6; we have 
(6) = (2) = (3) = (5) > ©). 


Proof. 
(6)=(2): Follows from Theorem 1. 


(2)=(3): Follows from Theorem 2, or more simply by direct integration of the differential 
equation (2). 


(3 )=(5): Follows ordering the Cauchy product of the developments a(h) and e!* with 
respect to the powers of h and recognizing polynomials A; (x), expressed in form (3), as 


coefficients of 77. 


(5)=(6): Follows from Theorem 3. 


Remark 5. In virtue of the Theorem 4, any of the relations (2), (3), (5), (6) can be assumed as definition 
of Appell polynomial sequences. 
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4. Examples of Appell polynomial sequences 


The following are classical examples of Appell polynomial sequences. 


a) Bernoulli polynomials ([17, 23]): 


1 ‘ 

Pe qe = 0,1c5; (22) 

h 
h) = ——; 23 
a(h) = 5 (23) 

b) Euler polynomials ([17, 23]): 
Ll 4 

Bo 1, Bi oy 1 Ds (24) 

2 
h) = ——,; 25 
a(t) = 5 (25) 


c) Normalized Hermite polynomials ([17, 24]): 


1 to on | 0 for i odd ; 
Bi = ae e*~ x'dx = (-1)(i-3) wa pean pene i=0,1)..., (26) 
23 
2 
a(h) = eT; (27) 
d) Laguerre polynomials ([17, 24]): 

+00 

B; a) e*xidx=T(i+1)=i, i=0,1).., (28) 
0 

a(h) =1-h; (29) 


The following are non-classical examples of Appell polynomial sequences. 


e) Generalized Bernoulli polynomials 
¢ with Jacobi weight ([17]): 


1 P j T(a+1)T(p+i4+1 . 
Bi= f (1— x) xP xidx = et ) a,B>—1, i=0,1,.., (30) 
a(h) = : (31) 
jae = x)*xBehx dy’ 
¢ of order k ([11]): 
k 
Bi = (3) , kinteger, i=0,1,..., (32) 


a(h) = (a4): (33) 


28 Linear Algebra — Theorems and Applications 


f) Central Bernoulli polynomials ([25]): 


1 
pai = 4" 
foi41 = 0, 1=0,1,..., (34) 
h 
g) Generalized Euler polynomials ([17]): 
Bo =1, 
eee i 
pi = ao W1,W2 >0, i=1,2,..., (36) 
WwW + W2 

h) = ——_*; 37 
sO aerae rare G7) 


h) Generalized Hermite polynomials ([17]): 


+00 a5 
Bi : [ ell x!dx 


~ Va. 
0 for i odd i=0,1,..., 
~ =. (2) forieven’ a>0O, (38) 
Jt 
h) = z : 39 
a feel ettdx ie 
i) Generalized Laguerre polynomials ([17]): 
ve HX yd 
bi = i e “*x'dx 
T(@+1) i! ; 
= a>0O, i1=0,1,..., (40) 
a(h) =a—h. (41) 


5. General properties of Appell polynomials 


By elementary tools of linear algebra we can prove the general properties of Appell 
polynomials. 


Let An (x), n = 0,1,...,be a polynomial sequence and 6; € R, i =0,1,..., with Bo £0. 
Theorem 5 (Recurrence). Ay (x) is the Appell polynomial sequence for B; if and only if 


n—1 
An(x) = 7 (« = ({,) Brn ), n= 1,2,.. (42) 


k=0 
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Proof. Follows observing that the following holds: 


1 x x2 eee eee xn-l xn 
Bo Bi Bo vt Bn-1 Bn 
0 Bo ()Bi<++ ++ Cr ene eA 
An (1) = EM] OO Bo vr C21 Bas Graal 
Oras 2a in Bo Cy wP1 
n—1 
- 3 @ 7 2 G By_4Ag 9] . Se (43) 


In fact, if Ay (x) is the Appell polynomial sequence for f;, from (6), we can observe that A, (x) 
is a determinant of an upper Hessenberg matrix of order n + 1 ([16]) and, proceeding as in 
Theorem 2, we can obtain the (43). O 


Corollary 2. If An (x) is the Appell polynomial sequence for B; then 


n 


xP = s e Bn—KAk (x), n=0,1,... (44) 


k=0 


Proof. Follows from (42). 


Corollary 3. Let P,, be the space of polynomials of degree < nand {Ayn(x)},, be an Appell polynomial 
sequence, then {Ay(x)},, is a basis for Pn. 


Proof. If we have 
n 
Py (x) = iz Angx®, Ank ER, (45) 
k=0 


then, by Corollary 2, we get 
n k k n 
Pals) = 32 dn Do (4) Be-j4 (2) = ene An, 
k=0 ~— j=0 J k=0 
where 


n—k k+i 
Cok = ms ( -) A+ ;Bj- (46) 


j=0 


Remark 6. An alternative recurrence relation can be determined from (5) after differentiation with 
respect to h ([18, 26]). 
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Let be B;, 7; € R, i=0,1,..., with Bo, Yo £0. 

Let us consider the Appell polynomial sequences Ay (x) and B, (x), n = 0,1,..., for Bj and 7;, 
respectively, and indicate with (AB),, (x) the polynomial that is obtained replacing in Ay (x) 
the powers x9, x4, xt, respectively, with the polynomials Bg (x) , By (x),..., Bn (x) . Then we 
have 


Theorem 6. The sequences 


i) AAn (x) +pBn (x), AMER, 


it) (AB), (x) 


are sequences of Appell polynomials again. 


Proof. i) Follows from the property of linearity of determinant. 


ii) Expanding the determinant (AB),, (x) with respect to the first row we obtain 


= £0" FS Kayicg.y ()z,_.B (x) = 
(AB), (x) (Bo) XI 1)’ (Bo) ) n—jBj (x) 


aoe ee 
where 
® = 1, 
Pi Pe Genet Bi-1Bi 
Bo G)py ees (7)Bi-2 G) Bina 
0 fo: ee (5")Bi-3 (5) Bi-2 
a; = : : , 221,205: 
0 - 0 Bo (44) B1 
We observe that 
yy! _ 
Aj (0) = ave iy ESM 
and hence (47) becomes 
(AB), (x) = ¥ (4) Any (0) Bj (2). (a8) 


j=0 


Differentiating both hand sides of (48) and since B; (x) isa sequence of Appell polynomials, 
we deduce 


((AB),, (x))’ = (AB),_y (x). (49) 
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Let us, now, introduce the Appell vector. 

Definition 3. If A; (x) is the Appell polynomial sequence for B; the vector of functions Ay (x) = 
[Ao (x), --, An(x)]* is called Appell vector for B;. 

Then we have 


Theorem 7 (Matrix form). Let Ay (x) be a vector of polynomial functions. Then Ay (x) is the Appell 
vector for B; if and only if, putting 


ihe. . A : 
(M);, = fe Pine ’ oe ee ee 
and X(x) = [1,x,...,x"]" the following relation holds 
X(x) = MAy (x) (51) 
or, equivalently, 
An (x) = (mt) X(x), (52) 


being M~! the inverse matrix of M. 


Proof. If Ay (x) is the Appell vector for 8; the result easily follows from Corollary 2. 
Vice versa, observing that the matrix M defined by (50) is invertible, setting 


Ne, idj 
(mt) = (ei; 12] i,j =0,...,n; (53) 
Uy] 


0 otherwise ’ 


we have the (52) and therefore the (3) and, being the coefficients a, and fh; related by (13), we 
have that A;,(x) is the Appell polynomial sequence for fj. 


Theorem 8 (Connection constants). Let Ay(x) and B,(x) be the Appell vectors for B; and ¥;, 
respectively. Then 


An(x) = CBy(x), (54) 
where ; 
Gey; iz] ; 
Chae Yel ; 7, =0,...,N. 55 
( dij 0 otherwise ad . 2) 
with 


Cn = y (i) On —KYVk- (56) 


Proof. From Theorem 7 we have 


with M as in (50) or, equivalently, 


An (x) = (m7) X(x), 
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with M~! as in (53). 
Always from Theorem 7 we get 
with 


Ny i>; 
Nin (i) Vij ie oe 
0 otherwise 


Then 
An (x) = M7!NB, (x), 
from which, setting C = M—!N, we have the thesis. 


Theorem 9 (Inverse relations). Let Ay (x) be the Appell polynomial sequence for 6; then the 
following are inverse relations: 


n (57) 
n 
xn = = k An—K(0)Yk- 
k=0 
Proof. Let us remember that 
Ax(0) = a, 
where the coefficients a, and fh, are related by (13). 
Moreover, setting ¥,, = [Yo, Yn]? and X¥, = [Xo,--, Xn], from (57) we have 
{e = MiXn 
Xn = Moy, 
with : 
G)Bi-; i 2] ay 
My);, =< V ] j ,] =0,...,n, 58 
( Vij 0 otherwise ad m (8) 
()a;; ij a 
M>).,=% VI , 7 =0,...,1, 59 
( Vij 0 otherwise ta " (9) 


and, from (13) we get 
MM) = In41, 


i.e. (57) are inverse relations. 


Theorem 10 (Inverse relation between two Appell polynomial sequences). Let A,(x) and 
B,,(x) be the Appell vectors for B; and y;, respectively. Then the following are inverse relations: 


a (60) 
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0 otherwise 0 otherwise 


(C)ij = e-j oe (Cc) - ()e-j ae i,j =0,.,N, 
: ij 


cn = : (i) An-K(0) Ver Cn = s (i) By—«(0)Br- 


k=0 k=0 


Proof. Follows from Theorem 8, after observing that 
. (ie = ={5 n=0 
n—k*k — 
ras k 0n>0 


ce = dn+1- 


and therefore 


Theorem 11 (Binomial identity). If Ay (x) is the Appell polynomial sequence for 6; we have 
n 


An (x+y) =) (‘) Aj(x)y""i, 1 =0,1).. 


i=0 


Proof. Starting by the Definition 1 and using the identity 


k=0 
we infer 
1 (x+y)! ---(x+y)" (x+y)” 
Bo Br o-> Buea Bn 
An(x+y) =| 0 = 
0 Bo Bal") 
() (+1) x1 (42) x2 _ (rao Fae (2)x"-i 
Bo Bis") Bo(%?) «++ Bnina("7") Bil) 
Sm (eam [0 Bo BAGGED > Brea) Banat 


Bo (Bol "| Bo 


(61) 


(62) 


(63) 


(64) 


(65) 
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We divide, now, each j—th column, j = 2,...,.1 —i+1, for (ens *) and multiply each h—th row, 
h =3,...,.n—i+1, for geome Thus we ae obtain 
An (x+y) = 
2. (yes @) 2 Ei 


_ L eae Ay (pare 
1 xi x2 ea ynt-i-l xn-i 
Bo Bi Po, Mee Bn— i-1 Bn— i 
0 Bo Bit es » Ba i- 2(" 4 : *) Bn- i- ("7) 
: Bo = 
Ou. cae 0 Bo BiG. i) 
n /n ‘ n/n eS 
=¥ [Arie (Hac 


Theorem 12 (Generalized Appell identity). Let An(x) and By,(x) be the Appell polynomial 
sequences for B; and ;, respectively. Then, if C,(x) is the Appell polynomial sequence for 5; with 


i4 
Co(0)7 ss 
Oe ao & (eee (Ot SI a 
and 
Cuy= ys, (;) B;_;(0)A;(0), 67) 
j=0 


nly +2) = = iG ) Ac(y)Bn-u(2) (68) 


Proof. Starting from (3) we have 


n 


Coy 2) = (2) Gr-nOy +2) (69) 


k=0 


Then, applying (67) and the well-known classical binomial identity, after some calculation, we 
obtain the thesis. 


Theorem 13 (Combinatorial identities). Let An(x) and By (x) be the Appell polynomial sequences 
for B; and ¥;, respectively. Then the following relations holds: 
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Xe (;) Ae(2Bra(—2) = (7) Ail 0)Bs 110) (70) 
D (f) Ae(2)Brel@) = Bo (7) ale +2)B 400) 7) 


Proof. If Cy(x) is the Appell polynomial sequence for 6; defined as in (66), from the 
generalized Appell identity, we have 


y (i) Ax(x)Bn—K(—x) = Cu(0) = y @ Ax (0) Bn—¢ (0) 


and 
n 
» 


k=0 


(;) Ay(x)By_4(Z) = Cn(x +z) = y (i) Ax(x + 2)Bn—x(0). 


k=0 


Theorem 14 (Forward difference). If Ay (x) is the Appell polynomial sequence for B; we have 


n—1 n 
AAn (x) = An (x +1) —An(x) = VO (7) Ai (yy n=O (72) 
i=0 


Proof. The desired result follows from (64) with y = 1. 


Theorem 15 (Multiplication Theorem). Let A,,(x) be the Appell vector for B;. 
The following identities hold: 


An (mx) = B(x)An (x) n=0,1,..., m=1,2,..., (73) 
An (mx) = M—'DX(x) n=0,1,.. m=1,2,.., (74) 
where ; tee 
(‘)\(m—1jiixi-i i> j . 
B 2. f V Z ,, =0,...,N, 75 
(B(x));, 0 otherwise’ "1 ~~" 


D = diag{1,m, ...,m"| and M~! defined as in (53). 


Proof. The (73) follows from (64) setting y = x (m— 1). In fact we get 
n n . iz 
An (mx) =) (7) A (x) (m—1)" x", (76) 
i=0 
The (74) follows from Theorem 7. In fact we get 


An(mx) = M~!X(mx) = M~'DX(x), (77) 
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and 


An (mx) = 0 (7) agi! (78) 


Theorem 16 (Differential equation). If A, (x) is the Appell polynomial sequence for B; then An (x 
satisfies the linear differential equation: 


FY (2) + PEE yO (x) + + Fey 2) + Bry (8) + Boyl) = 9) 


Proof. From Theorem 5 we have 


1 no (n+1 
Any1(x) = Bo Ge = > @ 1) Beery «()) : (80) 
k=0 
From Theorem 1 we find that 
(k) 
! _ a An (x) 
Anyi(%) — (n+1)An(x), and An—x(X) — n(n —1)...(n—k +1)’ (81) 
and replacing A;,_;(x) in the (80) we obtain 
oh pt peg Oe, Ae @ 
An4i(x) Bo ( (n l 1) De Best “= 1)! : (82) 


Differentiating both hand sides of the last one and replacing A’, , (x) with (n + 1) An(x), after 
some calculation we obtain the thesis. 


Remark 7. An alternative differential equation for Appell polynomial sequences can be determined by 
the recurrence relation referred to in Remark 6 ([18, 26]). 


6. Appell polynomial sequences of second kind 
Let f : 1 C IR — Rand A be the finite difference operator ([23]), ie.: 
Alf](x) = f(x +1) — f(x), (83) 
we define the finite difference operator of order i, with i € IN, as 
aiff) = aL) = LD) fe +9, (84) 
j=0 


meaning A° = ITand A! = A, where I is the identity operator. 
Let the sequence of falling factorial defined by 


(x)o =1, 
a (85) 
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we give the following 


Definition 4. Let 8; € IR, i = 0,1,..., with By 4 0. The polynomial sequence 


Bo’ 
1 (x)4 (x) ee sys (x)n—1 (X)n 
Bo By Bo ae Bai Br 
0 Bo CG i eee CT) n—2 (Bn 
aye]? 2 BOSE] wary 
ona Rey 


is called Appell polynomial sequence of second kind. 
Then, we have 
Theorem 17. For Appell polynomial sequences of second kind we get 


ANAn (x) = nNAn-1 (x) n=1,2,... (87) 


Proof. By the well-known relation ([23]) 
A(x), =N(X),-1, 1=1,2,.., (88) 


applying the operator A to the definition (86) and using the properties of linearity of A we 
have 


AL A(x); A(x)gs++ ++ A(X)n-1 A(*)n 
Bo By Bo a Bui Br 
0 Bo Gy crt Cy he CBs 
AAn (x) = a 0 0 fy vt 2 rs M2] na 2,.. 99) 
0 


We can expand the determinant in (89) with respect to the first column and, after multiplying 
the i-th row by i — 1, 7 = 2,...,n and the j-th column by i j = 1,...,n, we can recognize the 
factor A, (x). 


We can observe that the structure of the determinant in (86) is similar to that one of the 
determinant in (6). In virtue of this it is possible to obtain a dual theory of Appell polynomials 
of first kind, in the sense that similar properties can be proven ([19]). 
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For example, the generating function is 
H(x,h) =a(h)(1+h)*, (90) 


where a(/) is an invertible formal series of power. 


7. Examples of Appell polynomial sequences of second kind 


The following are classical examples of Appell polynomial sequences of second kind. 


a) Bernoulli polynomials of second kind ([19, 23]): 


Te a 
B= a4 t=O Lp, (91) 
H(x,h) = asl (92) 
“7 In(l +h)’ 
b) Boole polynomials ([19, 23]): 
1,i1=0 
B=) 5 i=1 (93) 
OF ea ee 
_ 20 +h)* 
H(x,h) = pay (94) 


8. An application to general linear interpolation problem 


Let X be the linear space of real functions defined in the interval [0,1] continuous and with 
continuous derivatives of all necessary orders. Let L be a linear functional on X such that 
L(1) # 0. If in (6) and respectively in (86) we set 


pi = LG), B; = L((x);), i=0,1,..., (95) 


A,(x) and A, (x) will be said Appell polynomial sequences of first or of second kind related 
to the functional L and denoted by A; ,(x) and Aj, ,,(x), respectively. 


Remark 8. The generating function of the sequence Aj, ,(x) is 


xh 
G(x,h) = teat’ (96) 
and for Ar n(x) is 
— (1+h)* 


where Ly means that the functional L is applied to the argument as a function of x. 


. 1 oo i 
Proof. For Azn(x) if G(x,h) = a(h)e*" with ae) =>yp Bit we have 
i=0 . 
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exh exh exh exh exh 
G(x, t) To hi ~ “oo id = ar = Lye)" 
am | Pi ELV L(X +H) 
i=0 i=0 i=0 


For A;,,(x), the proof similarly follows. 


Then, we have 


Theorem 18. Let w; € R,i = 0,...,n, the polynomials 


Px(x) = Yo Ar (2), (98) 
i=0 ~ 

Pa(x) = OAL (2) (99) 
i=0 " 


are the unique polynomials of degree less than or equal to n, such that 
L(P) = itw;, i =0,...,n, (100) 


L(A'P*) = ilw;, i=0,...,n. (101) 


Proof. The proof follows observing that, by the hypothesis on functional L there exists a 
unique polynomial of degree < n verifying (100) and , respectively, (101); moreover from 
the properties of A, ;(x) and A, ;(x), we have 


L(AW)(x)) = i(i-1)...(i— 7 + 1)L(Ari_j(2)) =}! (;) bi, (102) 
L(A‘ALj(x)) = i(i— 1)..@— f+ I)L(ALj_j(x)) = 3! (;) bij, (103) 


where 6;; is the Kronecker symbol. 


From (102) and (103) it is easy to prove that the polynomials (98) and (99) verify (100) and 
(101), respectively. 


Remark 9. For every linear functional L on X, {A,i(x)}, {AL i(x)}, i = 0,...,n, are basis for Py 
and, 1 Pn(x) € Py, we have 


n (i) 
Pa(x) = EP") ay (x), (104) 
i=0 . 
Pa(x) = HOP) ay (2), (105) 
i=0 : 


Let us consider a function f € X. Then we have the following 
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Theorem 19. The polynomials 


n (i) 
Pualflle) = YE A), (106) 
Piglflls) = eT? Ava) (107) 


Proof. Setting w; = Lf) , and respectively, w; = 


Theorem 18. 


7, 1 = 0,...,n, the result follows from 


Definition 5. The polynomials (106) and (107) are called Appell interpolation polynomial for f of first 
and of second kind, respectively. 


Now it is interesting to consider the estimation of the remainders 


Rinlfl(x) = f(x) — Pualfl(x), Vx € [0,1], (108) 
Ri alfl(*) = f(x) — Poalfl(x), Vx € [0,1]. (109) 
Remark 10. For any f € Py 
Rienlfl(x) =0, Renlx"**] £0, Vx € [0,1], (110) 
Rinlfl(®) =O Ri vl()n+1] £0, Vx € [0,1], (111) 


i. e. the polynomial operators (106) and (107) are exact on Py. 


For a fixed x we may consider the remainder R_,,,[f|(x) and Rj ,,[f](x) as linear functionals 
which act on f and annihilate all elements of P;,. From Peano’s Theorem ([27, p. 69]) if a linear 
functional has this property, then it must also have a simple representation in terms of f("+1). 
Therefore we have 


Theorem 20. Let f € C"*! [a,b] , the following relations hold 


Runlfix) =f Kula t)fD (at, Ye € (01), te) 


tb Pape tee 
Rialf) == f Kit fe4) (at, Vx € [0,1], (113) 
where 


Kn (x8) = Ren [(x- 84] = (2-H - (‘) L((x-#)%") Api(x), (14) 
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‘ E (ai(x - ty) ede 


Ki (x,t) = Rp, ((x—#)4] = (e- 8h i! 


i=0 


Proof. After some calculation, the results follow by Remark 10 and Peano’s Theorem. 


(115) 


Remark 11 (Bounds). If f("+!) € LP[0,1] and Ky (x,t), K* (x,t) € £4[0,1] with 7 + i = 1 then 


we apply the Holder’s inequality so that 


Ralls & (fo Kalo a)’ (f jer colar) 
IRt ALfI(x)| < - Ch |K* (x,t) yar) (f r (n-+1) co) at)" 


The two most important cases are p = g = 2andq=1,p=o: 


i) for p = q = 2 we have the estimates 


[Ron Ll(x)| < on UNIAN, [Rin LAN(&)] < om IILAIIL, 


where 


and 


HAP = f (40) @) at 


ii) for g = 1, p = © we have that 


1 1 1 1 
Runlf) < pM ff Mux, dt, IRE nLA=)| SMa fii (x. dt, 


where 
Mn+1 = sup iF ve ) (x). 


a<x<b 


A further polynomial operator can be determined as follows: 
for any fixed z € [0,1] we consider the polynomial 


= n (i) 
Prnlfl(x) = f(2) + Poalfl() — Poalfl@ = fle) + DE? (aria) - Avil), 


and, respectively, 


Pi alfl(x) = Fl2) + Pal f(x) — Pipl =f) + OP (aria) - ALi). 


(116) 


(117) 


(118) 


(119) 


(120) 


(121) 


(122) 
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Then we have the following 


Theorem 21. The polynomials Py, [f|(x), Pin [f] (x) are approximating polynomials of degree n for 
f (x), ie: 


Vee [0,1], f(%) =Pralfl@)+Rralfl(e), (123) 
f(x) = Pralfl(x) + Rialfl (x), (124) 
where 

Rynlfl(x) = Rralfl(x) — Rialfl(z), (125) 
Ren lfl(%) = Ri nlfl(*) — RF alfl(2), (126) 

with 
Ryalx']=0, i=0,.,n, Rzy[x**'] £0, (127) 
Rie) =o $= “Ki gl @naal 0. (128) 


Proof. Vx € [0,1] and for any fixed z € [0,1], from (108), we have 


f(x) — F(Z) = Pralfl (x) — Pealfl(@) + Ronlfl(*) — Roalfl(2), 


from which we get (123) and (125). The exactness of the polynomial P;, ,,[f](x) follows from 
the exactness of the polynomial P,, ,[f] (x). 


Proceeding in the same manner we can prove the result for the polynomial Pin [f] (x). 


Remark 12. The polynomials P,, ,[f](x), Pin [f](x) satisfy the interpolation conditions 


Pralfl() =f(2), LPL) = Lf), i=1)..0, (129) 


Piafl@=fee DAP al) SuA pf, 1san (130) 


9. Examples of Appell interpolation polynomials 


a) Taylor interpolation and classical interpolation on equidistant points: 
Assuming 
L(f) =f(xo), x0 € [0,1], (131) 
the polynomials Py,,[f](x) and P.,[f](x) are, respectively, the Taylor interpolation 
polynomial and the classical interpolation polynomial on equidistant points; 


b) Bernoulli interpolation of first and of second kind: 
¢ Bernoulli interpolation of first kind ([15, 21]): 
Assuming 


L(f) = | f(x)dx, (132) 
the interpolation polynomials P;,,[f](x) and Pr,n[f](x) become 


(0a) - Ff VO) 


i! 


Pialflle) = ff soae+ 7 B(x), 133) 


Algebraic Theory of Appell Polynomials with Application to General Linear Interpolation Problem 43 


i— lio 
Prnlfl(x Ds peewee © (30), 034 
where B;(x) are the classical Bernoulli polynomials ([17, 23]); 
¢ Bernoulli interpolation of second kind ([19]): 
Assuming 
L(f)=[Datf] (135) 
x=0 
where A~! denote the indefinite summation operator and is defined as the linear 
operator inverse of the finite difference operator A, the interpolation polynomials 
Pr ,Lf](x) and Pin [f](x) become 


Pr nlfl(x) = [A“'Dflx= +E 0 (Bui (x), (136) 

Pr nlfl(z) = f(0) + vr ) (Bik: (=) — Bri (0)), (137) 
where : _ 
n— i\ (_1)J-1 

Bix) = (‘) 5 ye (x), (138) 


j=i 
and Bi (x) are the Bernoulli polynomials of second kind ([19]); 


c) Euler and Boole interpolation: 
¢ Euler interpolation ([21]): 


Assuming 
_ fO)+FfM) 
L(f) =, (139) 
the interpolation polynomials P;,,[f](x) and Pz,n[f](x) become 
(i) 2) 
i=] . 
= ne f(i) (i) 
Pralfle) =f+E 2 OFF O ce (-e 0); aay 
i=1 7 
¢ Boole interpolation ([19]): 
Assuming 
L(f) = [Mf],-0, (142) 
where Mf is defined by 
Mf (x) = ae iae (143) 


the interpolation polynomials P7',,[f](x) and = Lf] (x) become 


i= 
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Pi nlfl(x) = f(0) 4 y FO+ HEY (ett(yy — ett(0)), (145) 
where a - 
Eni) = (‘) — EV), (146) 


ji 
and Et '(x) are the Boole polynomials ([19]). 


10. The algebraic approach of Yang and Youn 


Yang and Youn ([18]) also proposed an algebraic approach to Appell polynomial sequences 
but with different methods. In fact, they referred the Appell sequence, s;(x), to an invertible 


analytic function g(t): 
Sn(x) = call 1 gt (147) 
: dt \ g(t) eae 


and called Appell vector for g(t) the vector 


Sn (x) = [so(x), ---Sn(x)]? - (148) 

Then, they proved that 
fay es xt) Fa | 149 
es ls ho Ee Jo " Om 10. ue) 


being Wh [f(f)] = [FOF Or fH] and P,[f(t)] the generalized Pascal functional 
matrix of f(t) ([28]) defined by 


FOO iB] 4 
Prlf(t)));, =< Y F A = Oyacyins 150 
(Pal fC Wis 0 otherwise a e en) 
Expressing the (149) in matrix form we have 
Sn(x) = SX(x), (151) 
with 
S00 0 0.:--- 0 
Ee $11 0--- 0 
S = | $20 $21 822°°° O |} X(x) = [1,x,.,27]7, (152) 
Sn0 Sn1 Sn2 °° Snn 
where 


() [aT Ftem rete as 
sij= |. — j 1=0,..,n, J =0,...,1. 
i} | \gtt) = 


It is easy to see that the matrix S coincides with the matrix M7! introduced in Section 5, 
Theorem 7. 
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11. Conclusions 


We have presented an elementary algebraic approach to the theory of Appell polynomials. 
Given a sequence of real numbers f;, i = 0,1,..., Bg 4 0, a polynomial sequence on 
determinantal form, called of Appell, has been built. The equivalence of this approach with 
others existing was proven and, almost always using elementary tools of linear algebra, 
most important properties od Appell polynomials were proven too. A dual theory referred 
to the finite difference operator A has been proposed. This theory has provided a class of 
polynomials called Appell polynomials of second kind. Finally, given a linear functional L, 
with L(1) 4 0, and defined 

L(x!) = Bi, (L((x)i) =B;), (154) 


the linear interpolation problem 
L(P) = ite, (L(A'Ph) = itwi) Pn € Pn, wR, (155) 


has been considered and its solution has been expressed by the basis of Appell polynomials 
related to the functional L by (154). This problem can be extended to appropriate real 
functions, providing a new approximating polynomial, the remainder of which can be 
estimated too. This theory is susceptible of extension to the more general class of Sheffer 
polynomials and to the bi-dimensional case. 
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Chapter 3 


An Interpretation of Rosenbrock’s Theorem 
via Local Rings 


A. Amparan, S. Marcaida and |. Zaballa 
Additional information is available at the end of the chapter 
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1. Introduction 


Consider a linear time invariant system 
x(t) = Ax(t) + Bu(t) (1) 


to be identified with the pair of matrices (A,B) where A € F”*", B € F”*™ and F = R 
or C the fields of the real or complex numbers. If state-feedback u(t) = Fx(t) + v(t) is 
applied to system (1), Rosenbrock’s Theorem on pole assignment (see [14]) characterizes for 
the closed-loop system 

x(t) = (A+ BF)x(t) + Bo(t), (2) 
the invariant factors of its state-space matrix A + BF. This result can be seen as the solution 
of an inverse problem; that of finding a non-singular polynomial matrix with prescribed 
invariant factors and left Wiener—Hopf factorization indices at infinity. To see this we recall 
that the invariant factors form a complete system of invariants for the finite equivalence of 
polynomial matrices (this equivalence relation will be revisited in Section 2) and it will be seen 
in Section 4 that any polynomial matrix is left Wiener-Hopf equivalent at infinity to a diagonal 
matrix Diag(s", ie .skm), where the non-negative integers k,,...,k (that can be assumed 
in non-increasing order) form a complete system of invariants for the left Wiener-Hopf 
equivalence at infinity. Consider now the transfer function matrix G(s) = (s]— (A+BF))~'B 
of (2). This is a rational matrix that can be written as an irreducible matrix fraction description 
G(s) = N(s)P(s)~!, where N(s) and P(s) are right coprime polynomial matrices. In the 
terminology of [18], P(s) is a polynomial matrix representation of (2), concept that is closely 
related to that of polynomial model introduced by Fuhrmann (see for example [8] and the 
references therein). It turns out that all polynomial matrix representations of a system are right 
equivalent (see [8, 18]), that is, if P}(s) and P2(s) are polynomial matrix representations of the 
same system there exists a unimodular matrix U(s) such that P)(s) = P;(s)U(s). Therefore 
all polynomial matrix representations of (2) have the same invariant factors, which are the 
invariant factors of sI, — (A+ BF) except for some trivial ones. Furthermore, all polynomial 
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matrix representations also have the same left Wiener— Hopf factorization indices at infinity, 
which are equal to the controllability indices of (2) and (1), because the controllability indices 
are invariant under feedback. With all this in mind it is not hard to see that Rosenbrock’s 
Theorem on pole assignment is equivalent to finding necessary and sufficient conditions for 
the existence of a non-singular polynomial matrix with prescribed invariant factors and left 
Wiener-Hopf factorization indices at infinity. This result will be precisely stated in Section 5 
once all the elements that appear are properly defined. In addition, there is a similar result to 
Rosenbrock’s Theorem on pole assignment but involving the infinite structure (see [1]). 


Our goal is to generalize both results (the finite and infinite versions of Rosenbrock’s Theorem) 
for rational matrices defined on arbitrary fields via local rings. This will be done in Section 5 
and an extension to arbitrary fields of the concept of Wiener-Hopf equivalence will be needed. 
This concept is very well established for complex valued rational matrix functions (see for 
example [6, 10]). Originally it requires a closed contour, ‘y, that divides the extended complex 
plane (C U {o0}) into two parts: the inner domain (Q.,) and the region outside y (Q_), which 
contains the point at infinity. Then two non-singular m x m complex rational matrices Tj (s) 
and T>(s), with no poles and no zeros in 7, are said to be left Wiener-Hopf equivalent with 
respect to ¥ if there are m x m matrices U_(s) and U,(s) with no poles and no zeros inQ- UY 
and ©, U 7, respectively, such that 


Tp(s) = U_(s)T;(s)Uy(s). (3) 


It can be seen, then, that any non-singular m x m complex rational matrix T(s) is left 
Wiener—Hopf equivalent with respect to 7 to a diagonal matrix 


Diag ((s a z0)!) bs 


where Zo is any complex number in 0.4 and ky > --- > km are integers uniquely determined 
by T(s). They are called the left Wiener—Hopf factorization indices of T(s) with respect to 
¥ (see again [6, 10]). The generalization to arbitrary fields relies on the following idea: We 
can identify O4 Uy and (Q_ U7) \ {co} with two sets M and M’, respectively, of maximal 
ideals of C[s]. In fact, to each z9 € C we associate the ideal generated by s — zo, which is 
a maximal ideal of C[s]. Notice that s — zp is also a prime polynomial of C[s] but M and 
M', as defined, cannot contain the zero ideal, which is prime. Thus we are led to consider 
the set Specm(C|s]) of maximal ideals of C[s]. By using this identification we define the 
left Wiener-Hopf equivalence of rational matrices over an arbitrary field F with respect to 
a subset M of Specm(F[s]), the set of all maximal ideals of F[s]. In this study local rings 
play a fundamental role. They will be introduced in Section 2. Localization techniques 
have been used previously in the algebraic theory of linear systems (see, for example, [7]). 
In Section 3 the algebraic structure of the rings of proper rational functions with prescribed 
finite poles is studied (i-e., for a fixed M C Specm(F[s]) the ring of proper rational functions 


et with gcd(g(s),7(s)) = 1 for all (7(s)) € M). It will be shown that if there is an ideal 
generated by a linear polynomial outside M then the set of proper rational functions with 
no poles in M is an Euclidean domain and all rational matrices can be classified according 
to their Smith-McMillan invariants. In this case, two types of invariants live together for 
any non-singular rational matrix and any set M C Specm(F[s]): its Smith-McMillan and left 
Wiener—Hopf invariants. In Section 5 we show that a Rosenbrock-like Theorem holds true that 
completely characterizes the relationship between these two types of invariants. 
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2. Preliminaries 


In the sequel F[s] will denote the ring of polynomials with coefficients in an arbitrary field F 
and Specm(F[s]) the set of all maximal ideals of F[s], that is, 


Specm(F[s]) = {(7(s)) : 7(s) € F[s], irreducible, monic, different from 1}. (5) 


Let 71(s) € F[s] be a monic irreducible non-constant polynomial. Let S = F[s] \ (71(s)) be the 
multiplicative subset of F[s] whose elements are coprime with 71(s). We denote by F7(s) the 
quotient ring of F[s] by S; i.e., S~MF[s]: 


Fx(s) = {25 : p(s) als) € FIs] gcd(a(s),m(s)) = 1}. 6) 
This is the localization of F[s] at (7t(s)) (see [5]). The units of F,,(s) are the rational functions 
u(s) = 2S} such that ged (p(s), 7(s)) = 1 and ged(q(s), 72(s)) = 1. Consequentially, 
F,(s) = {u(s)m(s)4 : u(s) is a unit and d > of U {0}. (7) 
For any M C Specm(E{s}), let 
Fm (s) = 1\(n(s))em Fa(s) 


= {2 : p(s),q(s) € Fis], gcd(q(s), 7(s)) = 1 ((s)) € M}. 


(8) 


This is a ring whose units are the rational functions u(s) = 48) such that for all ideals (7r(s)) € 


M, gcd(p(s), 7(s)) = land ged(q(s), 7(s)) = 1. Notice that, in particular, if M = Specm(F[s]) 
then Fy4(s) = F[s] and if M = © then Fy4(s) = F(s), the field of rational functions. 


Moreover, if «(s) € F[s] is a non-constant polynomial whose prime factorization, a(s) = 
ko (s)"! +++ a&m(s)4", satisfies the condition that («;(s)) € M for all i, we will say that a(s) 
factorizes in M or «(s) has all its zeros in M. We will consider that the only polynomials that 
factorize in M = @ are the constants. We say that a non-zero rational function factorizes in M 


if both its numerator and denominator factorize in M. In this case we will say that the rational 
function has all its zeros and poles in M. Similarly, we will say that oe has no poles in M 
if p(s) A 0 and gcd(q(s), 7(s)) = 1 for all ideals (7(s)) € M. And it has no zeros in M if 
gcd(p(s), 7(s)) = 1 for all ideals (7(s)) € M. In other words, it is equivalent that na has 
no poles and no zeros in M and that Ae is a unit of Fy,(s). So, a non-zero rational function 
factorizes in M if and only if it is a unit in Fopecm(E[s])\M(S)- 


Let Fyy(s)’”*™ denote the set of m x m matrices with elements in Fj4(s). A matrix is invertible 
in Fyy(s)”*” if all its elements are in Fjy4(s) and its determinant is a unit in Fyy(s). We denote 
by Gln (Fy(s)) the group of units of Fyy(s)”*”. 


Remark 1. Let M,, Mz C Specm(F{s]). Notice that 
1. If My C Mp then Fy, (s) D Fy, (s) and Glu (Fy, (s)) > Glin (Fy, (s)). 
2s Fy,um, (s) = Fy, (s) NM Fm, (s) and Glin (Fu,um, (s)) — Glin (Fm, (s)) M Glin (Fm, (s)). 
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For any M C Specm(F{s]) the ring Fyy(s) is a principal ideal domain (see [3]) and its field 
of fractions is F(s). Two matrices T;(s),T>(s) € F(s)”*™ are equivalent with respect to M if 
there exist matrices U(s), V(s) € Glin(IFyy(s)) such that T>(s) = U(s)T,(s)V(s). Since Fyy4(s) is 
a principal ideal domain, for all non-singular G(s) € Fys(s)”*” (see [13]) there exist matrices 
U(s),V(s) € Gln(Fm(s)) such that 


G(s) = U(s) Diag(a1(s),...,am(s))V(s) (9) 


with a1(s) | --- | a&m(s) (“|” stands for divisibility) monic polynomials factorizing in M, 
unique up to multiplication by units of Fjy,(s). The diagonal matrix is the Smith normal form 
of G(s) with respect to M and @;(s),...,&m(s) are called the invariant factors of G(s) with 
respect to M. Now we introduce the Smith-McMillan form with respect to M. Assume that 


T(s) € F(s)*™ is a non-singular rational matrix. Then T(s) = Se with G(s) € Fyy(s)"*™ 


and d(s) € F{s] monic, factorizing in M. Let G(s) = U(s) Diag(ay(s),...,a&m(s))V(s) be the 
Smith normal form with respect to M of G(s), ie., U(s),V(s) invertible in Fyy(s)"*™" and 


«1(s) | --- | &m(s) monic polynomials factorizing in M. Then 
. €1(s) Em/(S) ) 
T(s) = U(s) Dia (23. Vis (10) 
EES fi(s)°” Pm(s) ) 
where un are irreducible rational functions, which are the result of dividing «;(s) by d(s) 
and canceling the common factors. They satisfy that e1(s) | --- | €m(s), Pm(s) | --- | p1(s) 


are monic polynomials factorizing in M. The diagonal matrix in (10) is the Smith-McMillan 


form with respect to M. The rational functions ot, i = 1,...,m, are called the invariant 
rational functions of T(s) with respect to M and constitute a complete system of invariants of 
the equivalence with respect to M for rational matrices. 


In particular, if M = Specm(F[s]) then Fspecm(e{s])(S) = F[s], the matrices U(s),V(s) € 
Gln (F[s]) are unimodular matrices, (10) is the global Smith-McMillan form of a rational 


matrix (see [15] or [14] when F = R or C) and ae are the global invariant rational functions 
of T(s). 


From now on rational matrices will be assumed to be non-singular unless the opposite is 
specified. Given any M C Specm(F[s]) we say that an m x m non-singular rational matrix has 
no zeros and no poles in M if its global invariant rational functions are units of Fyy,(s). If its 
global invariant rational functions factorize in M, the matrix has its global finite structure 
localized in M and we say that the matrix has all zeros and poles in M. The former 
means that T(s) € Glm(Fjy(s)) and the latter that T(s) € Glin (Fspecm(rjs})\m(s)) because 
det T(s) = det U(s) det V(s) Sus}“<nb) and det U(s),det V(s) are non-zero constants. The 
following result clarifies the relationship between the global finite structure of any rational 
matrix and its local structure with respect to any M C Specm(F{s]). 


Proposition 2. Let M C Specm(F[s]). Let T(s) € F(s)""*™ be non-singular with On Hy pa 


its global invariant rational functions and let AEE Sid | i. be irreducible rational functions such 
that €(s) |---| €m(s), Pm(s) | --- | 1 (s) are monic polynomials factorizing in M. The following 


properties are equivalent: 
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1. There exist T,(s), Tr(s) € F(s)"*™ such that the global invariant rational functions of Ty (s) are 


Be. ame}, Te() © Glm(Fa(s)) and T(s) = Ty(s)Tr(s). 


2. There exist matrices U;(s), U2(s) invertible in Fyy(s)"*™ such that 


1(s) Em(S) ) 
T(s) = Uj(s) Dia aay Un(s), 11 
(s) 1(s) 5(2e... Ym (S) 2(s) ( ) 
ie, als) &m(3) are the invariant rational functions of T(s) with respect to M. 


Yu(s)?"" "7 Pn(s) 
3. aj(s) = e;(s)el(s) and Bi(s) = pi(s)pi(s) with f(s), pi(s) € F[s] units of Fyy(s), fori = 


‘Lp.t gan 


Proof.- 1 = 2. Since the global invariant rational functions of T,(s) are >> % oo, 
there exist W,(s),Wo(s) € Glin(F[s]) such that Tz(s) = W,(s) Diag ae - F oe W2(s). 


1 
AS Fepecm(F{s]) (5) = Fs], by Remark 1.1, Wi(s),W2(s) € Glu(Fm(s)). Therefore, putting 
U;(s) = W,(s) and Up(s) = Wo(s)Tr(s) it follows that U;(s) and Up(s) are invertible in 


Fm(s)"*" and T(s) = Uy(s) Diag ( $4,..., 2°) Up(s). 


2 = 3. There exist unimodular matrices Vj (s), V2(s) € F[s]*™ such that 
. i(s) &m(S) ) 
T(s) = V4(s) Dia 7 Vo(s 12 
(8) = Vals) Diag (FS... FET) vals) (12) 
with rs = irreducible rational functions such that «1(s) | --- | &m(s) and Bm/(s) |---| B1(s) are 
monic polynomials. Write ae ) = mee : such that p;(s),q;(s) factorize in M and p’(s),q/(s) 


s) 
factorize in Specm(F{s]) \ M. Then 


T(s) = V;(s) Diag Gea Ent) Diag (a Ent) ) Vo(s) (13) 


with Vi(s) and Diag (48... ae i) Vo(s) invertible in Fyy(s)"*™. Since the 
pi(s) = €i(s) 


Smith-McMillan form with respect to M is unique we get that TC eo 


3 => 1. Write (12) as 


ye vteDine (28) ml) re Ole), 
713) = WiG)Diag (Foy Fey) Dias (ayer geay) BO 


It follows that T(s) = Tz(s)Tr(s) with T,(s) = V;(s) Diag (2 — a 

Diag (S1S,..., B41) Vols) € Gln(Fau(s)). 

Ronollany 3). Feb Xe )s= BAS) venom singular aid Vin Ma ee Specie) Sele tiaras) 

Mp = ©. if a) — Ein 
1 


are the invariant rational functions of T(s) with respect to M;,i = 1,2, 


) 
) 
Vo) e2 1 
then “)ea(s) aie are the invariant rational functions of T(s) with respect to My U Mp. 
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Proof.- Let Ry 1B, ot) be the global invariant rational functions of T(s). By Proposition 2, 
a;(s) = e}(s)n}(s), Bi(s) = p}(s)d}(s a ni(s),d}(s) € F[s] units of Fy, (s). On the other 
hand a;(s ) = €?(s)n?(s), B ys ) = y?(s)d?(s), with n?(s),d?(s) € F[s] units of Fy,(s). So, 


2 2 1 

e}(s)n}(s) = €?(s)n?(s) or equivalently n}(s) = “i ey 7740) = 4 me = ) The polynomials 
e}(s),€7(s) are coprime because e}(s) factorizes in My, €?(s) factorizes in My and MyM M2 = 
@. In consequence e}(s) | n?(s) and e?(s) | n}(s). Therefore, there exist polynomials a(s), 


Since a;(s) = e}(s)n}(s) = ef(s)e?(s)a'(s) and a;(s) = e?(s)n?(s) = 2(s)e}(s)a(s). This 
implies that a(s) = a’(s) unit of Fy,(s) Fyy,(s) = Fy,um,(s). Following the same ideas 
we can prove that B;(s) = #}(s)?(s)b(s) with b(s) a unit of Fyy,um,(s). By Proposition 2 
e1(s)eq(s) Ein (8)em (8) 
$1 (s)¥r(s)""~"” Puls) ¥n(s) 
Corollary 4. Let M1, Mz C Specm(F[s]). Two non-singular matrices are equivalent with respect to 
My, U Mp if and only if they are equivalent with respect to My and with respect to Mp. 


unit of Fy, (s), and a’(s), unit of Fjy,(s), such that n?7(s) = e}(s)a(s),n}(s) = €?(s)a'(s). 
s 


are the invariant rational functions of T(s) with respect to My; U M>.™ 


Proof.- Notice that by Remark 1.2 two matrices T,(s),To(s) € F(s)”*™ are equivalent with 
respect to Mj U Mp if and only if there exist Uj (s), U2(s) invertible in Fy, (s)”*" N Fy, (s)"*™ 
such that Tp(s) = Uy(s)T)(s)U2(s). Since U;(s) and Up(s) are invertible in both Fy, (s)"*™ 
and Fy, (s)"*™” then T;(s) and T>(s) are equivalent with respect to M, and with respect to 
Mb. 


Conversely, if T,(s) and T>(s) are equivalent with respect to M, and with respect to Mp then, 
by the necessity of this result, they are equivalent with respect to Mm \ (M,N M2), with respect 


to Mp \ (M,N M2) and with respect to M,N Mp. eis) En(s) 


ge)" *HLle (s) be the invariant rational 


functions of T,(s) and T>(s) with respect to M, \ (M7 Mp), ee ae we : be the invariant 
1 
rational functions of T;(s) and T>(s) with respect to My \ (MyM M2) and ae ae ae be 
1 m 


the invariant rational functions of T|(s) and T2(s) with respect to M,; M Mp. By Corollary 3 
et(s) e7(s) €7(s) En(S) €m(S) €m(S) 
Hy (s) wr(s) wy(s)7- °°” PhS) Pals) PACs) 
with respect to M; U Mp. Therefore, T (s) and Tp(s) are equivalent with respect to M, U My. 


must be the invariant rational functions of T;(s) and T>(s) 


Let F pr (8) be the ring of proper rational functions, that is, rational functions with the degree 
of the numerator at most the degree of the denominator. The units in this ring are the 
rational functions whose numerators and denominators have the same degree. They are called 
biproper rational functions. A matrix B(s) € Fp;(s)'"*™ is said to be biproper if it is a unit in 
Fp (s)™*"™ or, what is the same, if its determinant is a biproper rational function. 


Recall that a rational function t(s) has a pole (zero) at oo if t (2) has a pole (zero) at 0. 
Following this idea, we can define the local ring at oo as the set of rational functions, f(s), 
such that t (1) does not have 0 as a pole, that is, Foo(s) = {t(s) € F(s):t (4) € F;(s) }. If 


t(s) = FS with p(s) = ast + apyis'tt +--+ +aysP, ay £0, q(s) = bys! + by yas"tt +--+ + 
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bys, bg £0, p = 4(p(s)), q = d(q(s)), where d(-) stands for “degree of”, then 


a, a, a = —ft— 
r 1\ gtr te ta ash! + ayyish Teta gp F(S) oq—p (15) 
s by Drst 4 4 Pe byt + bp yst 1 +++ + by g(s 
st gftl sf 


Feo(s) = {a3 € F(s) : d(q(s)) > atp(s))}. (16) 


Thus, this set is the ring of proper rational functions, Fy, (s). 


Two rational matrices T,(s),T)(s) € F(s)”™*” are equivalent at infinity if there exist biproper 
matrices B,(s), Bz(s) € Gln (Fpr(s)) such that T2(s) = By(s)T;(s)Bz(s). Given a non-singular 
rational matrix T(s) € F(s)'*” (see [15]) there always exist By (s),Bz(s) € Gln (Fpr(s)) such 
that 

T(s) = By(s) Diag(s™,...,s7”") Bo(s) (17) 


where q, > --- > qm are integers. They are called the invariant orders of T(s) at infinity and 
the rational functions s™,...,s% are called the invariant rational functions of T(s) at infinity. 


3. Structure of the ring of proper rational functions with prescribed finite 
poles 


Let M’ C Specm(EF[s]). Any non-zero rational function t(s) can be uniquely written as t(s) = 
al n'(s) where ne) is an irreducible rational function factorizing in M' and at 
s) d'(s) d(s) d'(s) 


Fyr(s). Define the following function over F(s) \ {0} (see [15], [16]): 


is a unit of 


5: F(s)\ {0} > Z 
t(s) ++ d(d'(s)) —d(n'(s)). 


This mapping is not a discrete valuation of FF(s) if M’ 4 ©: Given two non-zero elements 
t1(s),t2(s) € F(s) it is clear that 5(t1(s)to(s)) = 6(t1(s)) + 6(to(s)); but it may not satisfy that 
5(t1(s) + to(s)) > min(d(t (s)),5(t2(s))). For example, let M’ = {(s— a) € Specm(R[s]) : a ¢ 
[—2, —1]}. Put ty(s) = aot and to(s) = sth. We have that 5(t)(s)) = d(s +1.5) —d(1) =1, 


5(t2(s)) = d(s + 1.5) — d(1) = 1 but 6(t)(s) + t(s)) = 4(2) =0. 


(18) 


is] 


However, if M’ = @ and t(s) = ae € F(s) where n(s),d(s) € F[s], d(s) 4 0, the map 


deo : F(s) + ZU {+o} (19) 


defined via Jo. (t(s)) = d(d(s)) — d(n(s)) if t(s) A 0 and 6.(t(s)) = +oe0if t(s) = Ois a discrete 
valuation of F(s). 


Consider the subset of F(s), Fyy(s) 9 Fp;(s), consisting of all proper rational functions 
with poles in Specm(F[s]) \ M’, that is, the elements of Fy (s) M Fpr(s) are proper rational 
functions whose denominators are coprime with all the polynomials 7(s) such that (7t(s)) € 


M’. Notice that g(s) € Fy (s) M Fpr(s) if and only if g(s) = n(s) ae where: 
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(a) n(s) € F[s] is a polynomial factorizing in M’, 


(b) ae is an irreducible rational function and a unit of Fyy(s), 
(c) 6(g(s)) — d(n(s)) > 0 or equivalently dco(g(s)) > 0. 


me) € Fpr(s). The units in Fy (s) M Fpr(s) are biproper rational 


a i, (n'(s)) = d(d'(s)), with n'(s),d'(s) factorizing in Specm(F[s]) \ M’. 
Furthermore, Fy (s) M Fyr(s) is an integral domain whose field of fractions is F(s) provided 


that M’ 4 Specm(F{s]) (see, for example, [15, Prop.5.22]). Notice that for M’ = Specm(EF[s]), 
Fy (s) M1 Fpr(s) = F[s] M Fp;(s) = F. 


In ae (c)i 


Assume that there are ideals in Specm(F[s]) \ M' generated by linear polynomials and let (s — 
a) be any of them. The elements of Fyy(s) M Fpr(s) can be written as g(s) = n(s)u(s) gay 
where n(s) € F[s] factorizes in M’, u(s) is a unit in Fy (s) OV Fpr(s) and d = 5(g(s)) > d(n(s)). 
If F is algebraically closed, for example F = C, and M’ 4 Specm(F[s]) the previous condition 
is always fulfilled. 


The divisibility in Fyy (s) MN Fpr(s) is characterized in the following lemma. 


Lemma 5. Let M' C Specm(F{s]). Let g1(s),2(s) € Fy(s) M1 Fpr(s) be such that g1(s) = 

ny (s) HH and g9(s) = no(s) we with n(s),n2(s) € Fs] factorizing in M’ and * mts 
1 dy 2 

irreducible rational functions, units of Fyy(s). Then gi(s) divides go(s) in Fyy(s)  Fpr(s) if and 


only if 


n1(s) | n2(s) in F{s] (20) 
6(g1(s)) — d(m1(s)) < 6(g2(s)) — d(np(s)). (21) 


Proof.- If g1(s) | go(s) then there exists g(s) = n(s) 5, 5 € Fy(s) N Fpr(s), with n(s) € F[s] 
factorizing in M’ and n'(s),d'(s) € Fis] coprime, factorizing in Specm(F[s]) \ M’, such 
that go(s) = g(s)g1(s). Equivalently, n2(s )F ua =n s)5 3 ny (s 3 Ho = n(s )ny(s) ae ey). 
So 2(s) = n(s)mj(s) and 4(go(s)) — d(ng(s)) = 6(g(s)) — d(n ns) + 6(g1(s)) — a(m(s 5). 


Moreover, as g(s) is a proper rational function, 5(g(s)) — d(n(s)) > O and 6d(go(s)) — 
d(nz(s)) > 5(gi(s)) — d(m1(s)). 


Conversely, if m;(s) | 2(s) then there is n(s) € F[s], factorizing in M’, such that n2(s) = 


n(s)n1(s). Write g(s) = ns) where ms is an irreducible fraction representation of 
ee, ie., a } = pene 3 after canceling possible common factors. Thus ma) = ne ae 
and 
5(g(s)) — d(n(s)) = d(a"(s)) — d(n'(s)) — d(n(s)) 
= d(dy(s)) + (4 (s)) — d(1p(s)) — d(dy(s)) — d(na(s)) +d(mi(s)) (22) 
= 6(g2(s)) — d(na(s)) — (6(g1(s)) — d(mi(s))) > 0 


Then g(s) € Fyy(s) M Fpr(s) and go(s) = g(s)91(s). a 
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Notice that condition (20) means that ¢;(s) | go(s) in Fyy(s) and condition (21) means that 

81(s) | 82(s) in Fpr(s). So, gi(s) | g2(s) in Fyr(s) 1 Fpr(s) if and only if gi(s) | ga(s) 

simultaneously in Fy (s) and Fp; (s). 

Lemma 6. Let M' C Specm(F[s]). Let gi(s),g2(s) € Fear(s) Fpr(s) be such that g1(s) = 

my(s) SHS and g9(s) = np(s) wie as in Lemma 5. If n1(s) and n(s) are coprime in Fs] and either 
1 2 


5(g1(s)) = d(m1(s)) or 5(g2(s)) = d(nz(s)) then g1(s) and go(s) are coprime in Fy (s) A Fpr(s). 


Proof.- Suppose that ¢1(s) and go(s) are not coprime. Then there exists a non-unit g(s) = 
n(s) ae € Fyr(s) M Fpr(s) such that g(s) | g1(s) and g(s) | go(s). As g(s) is not a unit, n(s) 
is not a constant or 6(g(s)) > 0. If n(s) is not a constant then n(s) | 11(s) and n(s) | 12(s) 
which is impossible because 11(s) and n(s) are coprime. Otherwise, if n(s) is a constant then 


5(g(s)) > Oand we have that 6(¢(s)) < 5(g1(s)) —d(m(s)) and 6(g(s)) < 5(go(s)) — d(no(s)). 
But this is again impossible. a 


It follows from this Lemma that if gj (s), g2(s) are coprime in both rings Fy (s) and Fp;(s) then 
g1(s), 82(s) are coprime in Fyy(s) M Fyr(s). The following example shows that the converse is 
not true in general. 


Example 7. eee that F = Rand M’ = Specm(R{s]) \ {(s* + 1)}. It is not difficult to 
prove that g1(s) = fy and go(s) = iq are coprime elements in Ry (s) M Rpr(s). Assume 
that there exists a non-unit g(s) = n(s Vie € Ry(s) A Rpr(s) such that g(s) | gi(s) and 
g(s) | g2(s). Then n(s) | s*, n(s) | s and 5(g(s)) — d(n(s)) = 0. Since g(s) is not a unit, n(s) 
cannot be a constant. Hence, n(s) = cs, c # 0, and 5(g(s)) = 1, but this is impossible because 


d'(s) and n'(s) are powers of s? +1. Therefore g1(s) and g2(s) must be coprime. However 
n1(s) = s? and p(s) = s are not coprime. 


Now, we have the following property when there are ideals in Specm(F{s]) \ M’, M’ C 
Specm(F[s]), generated by linear polynomials. 


Lemma 8. Let M’ C Specm(FF[s]). Assume that there are ideals in Specm(F[s]) \ M’ generated 
by linear polynomials and let (s — a) be any of them. Let ails S),2(s) € Fyr(s) NM Fpr(s) be such 


that g\(s) = ny(s)uyz(s ae : a, and go(s) = nz(s)ua(s eae If g1(s) and go(s) are coprime in 


s—a) s—a)®2 * 


Fy (s) VF p(s) then ny (s) and nz(s) are coprime in F[s] and either d, = d(n,(s)) or dz = d(np(s)). 


Proof.- Suppose that n(s) and n2(s) are not coprime in F{s]. Then there exists a non-constant 
n(s) € F[s] such that n(s) | m1(s) and n(s) | no(s). Let d = d(n(s)). Then g(s) = n(s) Gay 
is not a unit in Fyy(s) M Fy,(s) and divides g;(s) and g2(s) because 0 = d —d(n(s)) < dy — 
d(ny(s)) and 0 = d—d(n(s)) < dy — d(no(s)). This is impossible, so nj(s) and n2(s) must be 
coprime. 


Now suppose that dj > d(m1(s)) and dz > d(np(s)). Let d = min{d, — d(n1(s)),do — 
d(nz(s))}. We have that d > 0. Thus g(s) = = is not a unit in Fy (s) M Fpyr(s) and divides 
gi(s) and go(s) because d < d, —d(n,(s)) and d < dy —d(np(s)). This is again impossible 
and either d; = d(n1(s)) or dz = d(no(s)). a 


56 Linear Algebra — Theorems and Applications 


The above lemmas yield a characterization of coprimeness of elements in Fyy/(s) M Fpr(s) 
when M’ excludes at least one ideal generated by a linear polynomial. 


Following the same steps as in [16, p. 11] and [15, p. 271] we get the following result. 


Lemma 9. Let M’ C Specm(F[s]) and assume that there is at least an ideal in Specm(F[s]) \ M’ 
generated by a linear polynomial. Then F yy (s) MN Fpr(s) is a Euclidean domain. 


The following examples show that if all ideals generated by polynomials of degree one are in 
M’, the ring Fyy(s)N Fpr(s) may not be a Bezout domain. Thus, it may not be a Euclidean 
domain. Even more, it may not be a greatest common divisor domain. 


Example 10. Let F = Rand M’ = Specm(R{s]) \ {(s? + 1)}. Let g1(s) = £7) 82(s) = 

Rw (s) 1 Rpr(s). We have seen, in the previous example, that g1(s),g2(s) are coprime. We 
show now that the Bezout identity is not fulfilled, that is, there are not a(s),b(s) € Ryy(s)N 
Rpr(s) such that a(s)g1(s) + b(s)go(s) = u(s), with u(s) a unit in Ryy(s) M Rp-(s). Elements 
in Ry (s) M Rpr(s) are of the form aaa 
d(n(s)) and the units in Ry (s) M Rp (s) are non-zero constants. We will see that there are not 
elements a(s) = hy, b(s) = aie with n(s) and n’(s) coprime with s* + 1, 2d > d(n(s)) 
and 2d' > d(n'(s)) such that a(s)g1(s) + b(s)g0(s) = c, with c non-zero constant. Assume that 


ea fy ea siqq = ©. We conclude that c(s? + 1)4*1 or c(s? +. 1)“'+ is a multiple of 


s, which is impossible. 


with n(s) relatively prime with s? +1 and 2d > 


Example 11. Let F = Rand M’ = Specm(R{s]) \ {(s* + 1)}. A fraction g(s) = aie € 


) 
Rywr(s) O Rpr(s) if and only if 2d —d(n(s)) > 0. Let gi(s) = were 82(s) = a, 


Ry (s) OM Rpr(s). By Lemma 5: 
© 9(s) | 91(s) = n(s) | s? and 0 < 2d—d(n(s)) <6-2=4 
© 9(s) | go(s) = n(s) | s(s +1) and 0 < 2d —d(n(s)) <8-2=6. 


If n(s) | s* and n(s) | s(s +1) then n(s) = c or n(s) = cs with c a non-zero constant. Then 
g(s) | g1(s) and g(s) | go(s) if and only if n(s) = c andd < 2 or n(s) = cs and 2d < 5. So, the 
list of common divisors of g1(s) and go(s) is: 


C c cs cs 
, , , : F, Ores 23 
{e s?+1’ (s2+1)2’ 5241’ (s2 +1)? . . \ 2 


If there would be a greatest common divisor, say a, then n(s) = cs because n(s) must be 


a multiple of c and cs. Thus such a greatest common divisor should be either 


cs cs 
1 OF EHP! 


but CHE does not divide neither of them because 


4=6 (2 + 7 d(c) > max {2 (5 i) —d(cs),6 (~ —) ates) | =3. (24) 


Thus, ¢1(s) and g9(s) do not have greatest common divisor. 
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3.1. Smith-McMillan form 


A matrix U(s) is invertible in Fy (s)"*" M Fp; (s)”*™ if U(s) © Fyy (s)"*" 9 Fyr(s)"*™ and 
its determinant is a unit in both rings, Fyy(s) and Fp,(s), ie., U(s) € Glin (s) M Fpr(s)) if 
and only if U(s) € Glin (Eu (s)) OQ Glin (Fpr(s)). 


Two matrices G;(s),G2(s) € Fyy(s)"*" M1 Fpr(s)"*™ are equivalent in Fyy(s) M Fpr(s) if 
there exist U;(s), Uz(s) invertible in Fy (s)"*" A Fy,(s)*™ such that 


Go(s) = U;(s)G,(s)Up(s). (25) 


If there are ideals in Specm(F{s]) \ M’ generated by linear polynomials then F yy (s)  Fpr(s) 
is an Euclidean ring and any matrix with elements in Fyy(s) 9 Fpr(s) admits a Smith normal 
form (see [13], [15] or [16]). Bearing in mind the characterization of divisibility in Fyy(s) N 
Fpr(s) given in Lemma 5 we have 


Theorem 12. (Smith normal form in Fyy(s) M1 Fpr(s)) Let M’ C Specm(F\s]). Assume that 
there are ideals in Specm(F{s]) \ M’ generated by linear polynomials and let (s — a) be one of them. 
Let G(s) € Fyy(s)"*" OM Fyr(s)"*™ be non-singular. Then there exist Uy(s),U2(s) invertible in 
Fy (s)"*" 1 Fpr(s)"*™ such that 


; 1 1 
G(s) = U, (s) Diag (m (s) (s—a)h’ -..,Mm \——7r) Up(s) (26) 
with n1(s)|---|Mm(s) monic polynomials factorizing in M' and d,...,dm integers such that 0 < 
dy —d(ny(s)) <-+-+ <dm—d(nm(s)). 


Under the hypothesis of the last theorem 14(8) Soaps Mm(8) i 


tay 
system of invariants for the equivalence in Fy (s) M Fpr(s) and are called the invariant rational 
functions of G(s) in Fyy(s) MFpr(s). Notice that 0 < dj <--- < dm because nj(s) divides 
ni+1(s). 


Recall that the field of fractions of Fy (s)  Fpr(s) is F(s) when M’ 4 Specm(FF[s]). Thus we 
can talk about equivalence of matrix rational functions. Two rational matrices T;(s),T>(s) € 
F(s)"*” are equivalent in Fy (s) M Fp,(s) if there are Uy (s), Uz(s) invertible in Fyy(s)"*" 9 
Fpr(s)"*™ such that 


form a complete 


Tp(s) = Uy (s)T; (s)Up(s). (27) 
When all ideals generated by linear polynomials are not in M’, each rational matrix admits a 
reduction to Smith-McMillan form with respect to Fyy(s) M Fpr(s). 


Theorem 13. (Smith-McMillan form in Fy (s) 1 Fpr(s)) Let M’ C Specm(F{s]). Assume that 
there are ideals in Specm(F{s]) \ M’ generated by linear polynomials and let (s — a) be any of 
them. Let T(s) € F(s)"*™ be a non-singular matrix. Then there exist U(s),Up(s) invertible in 
Fy (s)"*" 0 Epr(s)"*™ such that 


€1(s) €m(S) 
‘ —a)y" s—q)tm 
nes 1D (© = Pere - ) a (28) 
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with SW) Fy (s) A Fpr(s) coprime for all i such that €;(s), p;(s) are monic polynomials 


(s—a)"i y (s—a)%i 
(s) Pils) Yi-1(s) 
: (s—a)4 ea) for 


oe . / Ej €, 
factorizing in M', ea) 


a divides ce fori = 1,...,m—1 while - divides 


i=2,...,m. 


ej(s) 
(—a)"i 
PA) 
(s-a)%i 
constitute a complete system of invariant for the equivalence in Fyy(s) M Fpr(s) of rational 
matrices. However, this system of invariants is not minimal. A smaller one can be obtained 


by substituting each pair of positive integers (n;,d;) by its difference ]; = nj — dj. 


The elements of the diagonal matrix, satisfying the conditions of the previous theorem, 


Theorem 14. Under the conditions of Theorem 13, en cat with e;(s), p(s) monic and coprime 


polynomials factorizing in M’, e;(s) | €:41(s) while p;(s) | pj—1(s) and ,...,lm integers such that 
1, + d(q1(s)) — d(eq(s)) < +++ < In +4(m(s)) — d(€m(s)) also constitute a complete system of 
invariants for the equivalence in Fy (s) M Fpr(s). 


ei(s) 1 


Proof.- We only have to show that from the system Bis) (xa? i 


= 1,...,m, satisfying the 
e;(s) 


(s—a)i 
¥i(s) 


(s—a)4i 


conditions of Theorem 14, the system ,i = 1,...,n, can be constructed satisfying the 


conditions of Theorem 13. 


Suppose that e€;(s), #;(s) are monic and coprime polynomials factorizing in M’ such that 
€;(s) | €j41(s) and ;(s) | pj-1(s). And suppose also that l,,...,l are integers such that 
I, + d(ipr(s)) — d(er(s)) < +++ < Im +d(m(s)) — d(Em(s)). If 1; + d(Pi(s)) — d(ei(s)) < 0 for 
all i, we define non-negative integers n; = d(e;(s)) and d; = d(e;(s)) — 1; fori = 1,...,m. If 
1; + d(;(s)) — d(e;(s)) > 0 for all i, we define n; = 1; + d(y;(s)) and d; = d(;(s)). Otherwise 
there is an index k € {2,...,m} such that 


Ik_1 + d(-1(8)) — d(€x_1(8)) < 0 < Te + d(H (s)) — d(ex(s)). (29) 


Define now the non-negative integers n;,d; as follows: 


__ f d(ei(s)) ifi<k , _ fd(e(s))—lifi<k 

o— eee ifi>k “7 Lae) ifi>k Go 

Notice that J; = nj — d;. Moreover, 

0 ifi<k 
ri e(S)) = 17 4 a(y(a)) ~dle(a)) #1 ~ 
d; — d(wp;(s)) ae ce i(s)) + d(e;(s)) a (32) 
and using (29), (30) 

ny — d(e1(s)) = +++ = mp_y — d(ex_1(s)) = 0 < mp — d(ex(S)) S +++ Sm —d(Em(s)) (3) 


dy —d(wpi(s)) 2 +++ 2 de_1 — d(p_1(s)) 2 0 = de — d(x (s)) = +++ = dm —d(Wm(s)). GA) 
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In any case a and oS, are elements of Fyy(s)  Fy;(s). Now, on the one hand 


€;(s), p;(s) are coprime and n; — d(e;(s)) = 0 or d; — d(;(s)) = 0. This means (Lemma 6) that 
ci(s)_ _Vil5) are coprime for all i. On the other hand e;(s) | €;41(s) and 0 < n; — d(e;(s)) < 
( 


(s—a)"i y (s—a)%i 
€i(s) 


ni+1 — d(€j41(s)). Then (Lemma 5) (sa) divides 


€i41(s) 
(s—a)"41 * 


and 0 < d; — d(y;(s)) < dj_1 — d(y;_1(s)), it follows that “©, divides ¥=-15)_ a 


(s—a)%i (s—a)4i-1* 


Similarly, since p;(s) | pi-1(s) 


We call oa Gare i=1,...,m, the invariant rational functions of T(s) in Fyy(s) 1 Fpr(s). 
There is a particular case worth considering: If M’ = © then Fg(s) 9 Fpr(s) = Fpr(s) and 
(s) € Specm(F{s]) \ M’ = Specm(FF[s]). In this case, we obtain the invariant rational functions 
of T(s) at infinity (recall (17)). 


4. Wiener—Hopf equivalence 


The left Wiener-Hopf equivalence of rational matrices with respect to a closed contour in the 
complex plane has been extensively studied ([6] or [10]). Now we present the generalization 
to arbitrary fields ([4]). 


Definition 15. Let M and M' be subsets of Specm([F[s]) such that MU M' = Specm(F{s]). Let 
T,(s), To(s) € F(s)™*™ be two non-singular rational matrices with no zeros and no poles in MM M’. 
The matrices T;(s),T2(s) are said to be left Wiener—Hopf equivalent with respect to (M, M') if there 
exist both Uy (s) invertible in Fy (s)"*™ 1 Fpr(s)"*™ and Up(s) invertible in Fyy(s)"*™ such that 


Ta(s) = Un(s)Ti (s)Ud(s). (35) 


This is, in fact, an equivalence relation as it is easily seen. It would be an equivalence relation 
even if no condition about the union and intersection of M and M’ were imposed. It will 
be seen later on that these conditions are natural assumptions for the existence of unique 
diagonal representatives in each class. 


The right Wiener-Hopf equivalence with respect to (M, M’) is defined in a similar manner: 
There are invertible matrices Uj (s) in Fyy(s)"*™" A Fpr(s)”*™ and Up(s) in Fy(s)”*™ such 
that 

Ty(s) = Up(s)Ty(s)Uh (s). (36) 
In the following only the left Wiener-Hopf equivalence will be considered, but, by 
transposition, all results hold for the right Wiener—-Hopf equivalence as well. 


The aim of this section is to obtain a complete system of invariants for the Wiener-Hopf 
equivalence with respect to (M, M’) of rational matrices, and to obtain, if possible, a canonical 
form. 


There is a particular case that is worth-considering: If M = Specm(F[s]) and M' = @, the 
invertible matrices in Fg(s)”*" M Fy,(s)"*™ are the biproper matrices and the invertible 
matrices in Fspecm(F\[s]) (8) *™ are the unimodular matrices. In this case, the left Wiener-Hopf 
equivalence with respect to (M, M’) = (Specm(EF[s]),@) is the so-called left Wiener-Hopf 
equivalence at infinity (see [9]). It is known that any non-singular rational matrix is left 
Wiener—Hopf equivalent at infinity to a diagonal matrix Diag(s*',...,s%") where 91,...,2m 
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are integers, that is, for any non-singular T(s) € F(s)’*” there exist both a biproper matrix 
B(s) € Gln (Fpr(s)) and a unimodular matrix U(s) € Gln (F[s]) such that 


T(s) = B(s) Diag(s*',...,s8")U(s) (37) 


where g1 > --- > 8m are integers uniquely determined by T(s). They are called the left 
Wiener—Hopf factorization indices at infinity and form a complete system of invariants for 
the left Wiener-Hopf equivalence at infinity. These are the basic objects that will produce the 
complete system of invariants for the left Wiener-Hopf equivalence with respect to (M, M’). 


For polynomial matrices, their left Wiener-Hopf factorization indices at infinity are the 
column degrees of any right equivalent (by a unimodular matrix) column proper matrix. 
Namely, a polynomial matrix is column proper if it can be written as P, Diag(s*!,...,s8") + 
L(s) with P, € F”*™ non-singular, g1,...,%m non-negative integers and L(s) a polynomial 
matrix such that the degree of the ith column of L(s) smaller than g;,1 <i < m. Let P(s) € 
F{s]"*”™ be non-singular polynomial. There exists a unimodular matrix V(s) € F[{s]"*™ such 
that P(s)V(s) is column proper. The column degrees of P(s)V(s) are uniquely determined 
by P(s), although V(s) is not (see [9], [12, p. 388], [17]). Since P(s)V(s) is column proper, it 
can be written as P(s)V(s) = P-D(s) + L(s) with P, non-singular, D(s) = Diag(s%',...,s8") 
and the degree of the ith column of L(s) smaller than g;, 1 < i < m. Then P(s)V(s) = 
(P; + L(s)D(s)~!)D(s). Put B(s) = P; + L(s)D(s)~. Since P, is non-singular and L(s)D(s)~! 
is a strictly proper matrix, B(s) is biproper, and P(s) = B(s)D(s)U(s) where U(s) = V(s)~!. 


The left Wiener—Hopf factorization indices at infinity can be used to associate a sequence of 
integers with every non-singular rational matrix and every M C Specm(F[s]). This is done 
as follows: If T(s) € F(s)”*™ then it can always be written as T(s) = Ty(s)Tr(s) such that 
the global invariant rational functions of T,(s) factorize in M and Tr(s) € Glin(Fu(s)) or, 
equivalently, the global invariant rational functions of Tr(s) factorize in Specm(F[s]) \ M 
(see Proposition 2). There may be many factorizations of this type, but it turns out (see [1, 
Proposition 3.2] for the polynomial case) that the left factors in all of them are right equivalent. 
This means that if T(s) = Tz1(s)Tri(s) = Tr2(s)Tr2(s) with the global invariant rational 
functions of Ty1(s) and Tz2(s) factorizing in M and the global invariant rational functions of 
Tri(s) and Tro(s) factorizing in Specm(F[s]) \ M then there is a unimodular matrix U(s) such 
that T,1(s) = Ty2(s)U(s). In particular, T,;(s) and Ty2(s) have the same left Wiener-Hopf 
factorization indices at infinity. Thus the following definition makes sense: 


Definition 16. Let T(s) € F(s)”*™ be a non-singular rational matrix and M C Specm(F{s]). Let 
TL(s), Tr(s) € F(s)”*"™ such that 

i) T(s) = T(s)Tr(s), 

ii) the global invariant rational functions of T,(s) factorize in M, and 

iii) the global invariant rational functions of Tr(s) factorize in Specm(F{s]) \ M. 


Then the left Wiener-Hopf factorization indices of T(s) with respect to M are defined to be the left 
Wiener—Hopf factorization indices of T,,(s) at infinity. 


In the particular case that M = Specm(F[s]), we can put T,(s) = T(s) and Tr(s) = In. 
Therefore, the left Wiener—Hopf factorization indices of T(s) with respect to Specm(F[s]) are 
the left Wiener-Hopf factorization indices of T(s) at infinity. 
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We prove now that the left Wiener-Hopf equivalence with respect to (M,M’) can be 
characterized through the left Wiener—Hopf factorization indices with respect to M. 


Theorem 17. Let M, M’ C Specm(EF[s]) be such that MU M' = Specm(F[s]). Let T;(s), To(s) € 
F(s)”*"™ be two non-singular rational matrices with no zeros and no poles in MM M"'. The matrices 
T,(s) and T>(s) are left Wiener—Hopf equivalent with respect to (M,M’) if and only if T,(s) and 
Tp(s) have the same left Wiener—Hopf factorization indices with respect to M. 


Proof.- By Proposition 2 we can write T;(s) = Tz1(s)Tri(s), To(s) = Tr2(s)Tr2(s) with the 
global invariant rational functions of T,1(s) and of Tz2(s) factorizing in M \ M’ (recall that 
Ti(s) and T>(s) have no zeros and no poles in MM M’) and the global invariant rational 
functions of Ta; (s) and of Tr9(s) factorizing in M’ \ M. 


Assume that T,(s), T2(s) have the same left Wiener-Hopf factorization indices with respect 
to M. By definition, T;(s) and Tp(s) have the same left Wiener-Hopf factorization indices 
with respect to M if Tr1(s) and Tr2(s) have the same left Wiener-Hopf factorization indices at 
infinity. This means that there exist matrices B(s) € Gly (Fpr(s)) and U(s) € Gln (F[s]) such 
that Tr2(s) = B(s)T;1(s)U(s). We have that To(s) = Tr2(s)Tro(s) = B(s)Tz1(s)U(s)TrRo(s) = 
B(s)T1(s)(Tri(s) !U(s)Tro(s)). We aim to prove that B(s) = Tz2(s)U(s)~'Ti1(s)~! is 
invertible in Fyy(s)”*™” and Tr1(s)~!U(s)Tro(s) € Gln(Fy(s)). Since the global invariant 
rational functions of Tz2(s) and Tz;(s) factorize in M \ M’, Tr2(s),Tri(s) € Fyp(s)™"*™ 
and B(s) € Fyy(s)”*”. Moreover, det B(s) is a unit in Fyy(s)”*™ as desired. Now, 
Tri(s)~!U(s)Tr2(s) € Gln(IFyy(s)) because Tr (s), Tr2(s) € Fy(s)"*™ and det Ta1(s) and 
det Tr (s) factorize in M’ \ M. Therefore T,(s) and T>(s) are left Wiener-Hopf equivalent with 
respect to (M, M’). 


Conversely, let Uy(s) © Glin(Fiw(s)) A Gla (Fpr(s)) and Up(s) € Gln(Fu(s)) such that 
T,(s) = Uj(s)T2(s)Up(s). Hence, T}(s) = Tri(s)Tri(s) = Ui(s)Tr2(s)Tro(s)U2(s). Put 
Tr2(s) = Uy(s)Tz2(s) and Tro(s) = Tro(s)Up(s). Therefore, 


(i) T(s) = Tr1(s)Tri(s) = Tr2(s)TRo(s), 
(ii) the global invariant rational functions of T;1(s) and of Tz2(s) factorize in M, and 
(iii)the global invariant rational functions of Tr; (s) and of T2(s) factorize in Specm(IF[s]) \ M. 


Then Tzi(s) and Ty2(s) are right equivalent (see the remark previous to Definition 16). 
So, there exists U(s) € Gln(F[s]) such that Ty1(s) = Tro(s)U(s). Thus, Ty1(s) = 
U;(s)Tz2(s)U(s). Since U;(s) is biproper and U(s) is unimodular T,;(s), Tr2(s) have the 
same left Wiener—Hopf factorization indices at infinity. Consequentially, T|(s) and Tp(s) have 
the same left Wiener—Hopf factorization indices with respect to M. | 


In conclusion, for non-singular rational matrices with no zeros and no poles in MN M’ the left 
Wiener—Hopf factorization indices with respect to M form a complete system of invariants for 
the left Wiener-Hopf equivalence with respect to (M, M’) with MU M’ = Specm(F[s]). 

A straightforward consequence of the above theorem is the following Corollary 

Corollary 18. Let M,M' C Specm(EF[s]) be such that MUM! = Specm(F{s]). Let T;(s), 
To(s) € F(s)"*™ be non-singular with no zeros and no poles in MM M'. Then T\(s) and 
T2(s) are left Wiener—Hopf equivalent with respect to (M, M’) if and only if for any factorizations 
T,(s) = Tr1(s)Tr1(s) and T2(s) = Tr2(s)Tro(s) satisfying the conditions (i)—(iii) of Definition 16, 
Tr1(s) and Tr2(s) are left Wiener—Hopf equivalent at infinity. 
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Next we deal with the problem of factorizing or reducing a rational matrix to diagonal form 
by Wiener—Hopf equivalence. It will be shown that if there exists in M an ideal generated by 
a monic irreducible polynomial of degree equal to 1 which is not in M’, then any non-singular 
rational matrix, with no zeros and no poles in MM M’ admits a factorization with respect to 
(M, M ). Afterwards, some examples will be given in which these conditions on M and M 
are removed and factorization fails to exist. 


Theorem 19. Let M, M' C Specm(F{s]) be such that MU M' = Specm(E[s]). Assume that there 
are ideals in M \ M’ generated by linear polynomials. Let (s — a) be any of them and T(s) € F(s)"*™ 
a non-singular matrix with no zeros and no poles in MM M'. There exist both U;(s) invertible in 
Fy (s)"*" 1 Fpr(s)™*™ and Up(s) invertible in Fyy(s)"*" such that 


T(s) = Uy(s) Diag((s — a)"',...,(s — a)*")Up(s), (38) 


where kj > --- > km are integers uniquely determined by T(s). Moreover, they are the left 
Wiener—Hopf factorization indices of T(s) with respect to M. 


Proof.- The matrix T(s) can be written (see Proposition 2) as T(s) = Tz,(s)Tr(s) with the 
global invariant rational functions of T,(s) factorizing in M \ M’ and the global invariant 
rational functions of Tr(s) factorizing in Specm(F[s]) \ M = M’' \ M. Asky,...,km are the left 
Wiener—Hopf factorization indices of T;,(s) at infinity, there exist matrices U(s) € Glin (F[s]) 
and B(s) € Glm(Fpr(s)) such that T,(s) = B(s)D1(s)U(s) with D,(s) = Diag(s“,...,sk). 
Put D(s) = Diag((s—a)",...,(s — a)k) and U;(s) = B(s) Diag (oe tn) . Then 
Tr(s) = Uy(s)D(s)U(s). If Up(s) = U(s)TR(s) then this matrix is invertible in Fyy(s)"*™ 
and T(s) = Uj(s) Diag((s — a)",...,(s — a)*")Ub(s). We only have to prove that LU;(s) is 
invertible in Fyy(s)"*" M Fpr(s)"*™. It is clear that Uj(s) is in Fpr(s)”*™ and biproper. 
Moreover, the global invariant rational functions of T,(s) Uj(s) = Tz(s)(D(s)U(s))~! 
factorize in M \ M’. Therefore, U;(s) is invertible in Fyy (s)"*™. 


We prove now the uniqueness of the factorization. Assume that T(s) also factorizes as 
T(s) = Uy(s) Diag((s —a)",...,(s — a)*")th(s), (39) 


with k, >--- > kn integers. Then, 


Diag((s — a)",...,(s —a)*") = Ch (s)~!Uj(s) Diag((s — a)"',..., (s — a)‘ ) Uy (s)Up(s)~! 


The diagonal matrices have no zeros and no poles in MM M’ (because (s — a) € M \ M’) 
and they are left Wiener-Hopf equivalent with respect to (M, M’). By Theorem 17, they have 


the same left Wiener—Hopf factorization indices with respect to M. Thus, k; = k; for alli = 
1,...,m. | 


Following [6] we could call left Wiener—Hopf factorization indices with respect to (M, M’) the 
exponents k; > --- > ky» appearing in the diagonal matrix of Theorem 19. They are, actually, 
the left Wiener-Hopf factorization indices with respect to M. 


Several examples follow that exhibit some remarkable features about the results that have 
been proved so far. The first two examples show that if no assumption is made on the 
intersection and/or union of M and M’ then existence and/or uniqueness of diagonal 
factorization may fail to exist. 
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Example 20. If P(s) is a polynomial matrix with zeros in MM M’ then the existence of 
invertible matrices Uy(s) € Gln(Fyw(s)) A Gln (Fpr(s)) and Up(s) € Glnn(Fy(s)) such that 
P(s) = U,(s) Diag((s — a)",...,(s — a)*")Up(s) with (s— a) € M\ M! may fail. In fact, 
suppose that M = {(s),(s +1)}, M’ = Specm F[s] \ {(s)}. Therefore, MM M’ = {(s+1)} 
and (s) € M\ M’. Consider p;(s) = s +1. Assume that s +1 = u1(s)s*u2(s) with u1(s) a 
unit in Fyy(s) NFp;(s) and u2(s) a unit in Fyy(s). Thus, u1(s) = c a nonzero constant and 
ug(s) = ¢° which is not a unit in Fyy(s). 


Example 21. If MUM! 4 Specm F{s] then the factorization indices with respect to (M, M’) 
may be not unique. Suppose that (B(s)) ¢ MU M’, (7(s)) € M\ M! with d(7(s)) = 1 and 
p(s) = u4(s)7(s)*uo(s), with uj (s) a unit in Fry (s) M Fpr(s) and u2(s) a unit in Fy4(s). Then 


p(s) can also be factorized as p(s) = ti;(s)7(s)*-4(P)) fin (s) with a (s) = uy (s) ee 


in Fy (s) M Fp;(s) and ti2(s) = B(s)uz(s) a unit in Fyy(s). 


a unit 


The following example shows that if all ideals generated by polynomials of degree equal to 
one are in M’ \ M then a factorization as in Theorem 19 may not exist. 


Example 22. Suppose that F = IR. Consider M = {(s?+1)} C Specm(R[s]) and M’ = 
Specm(IR{s}) \ {(s* + 1)}. Let 


P(s) = Ee (<2 : 12 | . (41) 


Notice that P(s) has no zeros and no poles in MM M! = @. We will see that it is not possible 
to find invertible matrices U;(s) € Ry (s)?** N Rpr(s)?** and Up(s) € Ryy(s)?*? such that 


U; (s) P(s)Up(s) = Diag((p(s)/4(s))™, (p(s)/q(s))*). (42) 


We can write me u(s)(s* +1)" with u(s) a unit in Ryy(s) anda € Z. Therefore, 
q(s) 


Diag((p(s)/q(s))", (p(s)/q(s))) = Diag((s? + 1)", (s? + 1)**) Diag(u(s)", u(s)*). (43) 


Diag(u(s)°', u(s)) is invertible in IRyj(s)?*? and P(s) is also left Wiener-Hopf equivalent 
with respect to (M, M’) to the diagonal matrix Diag((s* + 1), (s* + 1)*). 


Assume that there exist invertible matrices U;(s) € Ry(s)?%* M Rpr(s)?** and Up(s) € 
Ryu(s)?%* such that U;(s)P(s)Uz(s) = Diag((s? + 1)", (s* + 1)%), with d, > dz integers. 
Notice first that det U;(s) is a nonzero constant and since det P(s) = s(s* + 1) and det Up(s) 
is a rational function with numerator and denominator relatively prime with s? +1, it follows 
that cs(s* + 1)? det Up(s) = (s? +1)"+. Thus, dy + dz = 2. Let 


Rete Eee fe > me bas | (44) 


bo1(s) ba2(s) Uzi ($) U22(s) 
From P(s)U>(s) = U;(s)~! Diag((s? + 1)#", (s* + 1)) we get 
suy1(s) = byy(s)(s* +1)", (45) 


— 8741 (s) + (s* + 1)2u21(s) = boi(s)(s? +1), (46) 
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su (s) = by2(s)(s* + 1)®, (47) 

—s*uy2(s) + (s* + 1)?u29(s) = bao(s)(s* +1)”. (48) 

As u11(s) € Ru(s) and by1(s) € Ryr(s) M Rpr(s), we can write 141 (s) = At and by; (s) = 
he with f1(s),91(s),/1(s) € Rs], gcd(g1(s),s? +1) = 1 and d(hy(s)) < 2q1. Therefore, 
by (45), si = ee (s? +.1)%. Hence, u44(s) = fi(s) or uy1(s) = a In the same 


way and using (47), uy2(s) = fo(s) or u42(s) = hls) with f(s) a polynomial. Moreover, by 
(47), dz must be non-negative. Hence, d; > dz > 0. Using now (46) and (48) and bearing 
in mind again that u1(s), u22(s) € Ry(s) and bz; (s),bz2(s) € Ryyr(s) A Rpr(s), we conclude 


that u21(s) and u9(s) are polynomials. 


We can distinguish two cases: dj = 2, dy = O and dj = dy = 1. If dy = 2 and dz = 0, by (47), 
by9(s) is a polynomial and since bj)(s) is proper, it is constant: bj) (s) = cy. Thus uy9(s) = 2. 
By (48), boo (s) = —cys + (s? + 1)?u129(s). Since u2(s) is polynomial and b»9(s) is proper, b29(s) 
is also constant and then u29(s) = 0 and cy = 0. Consequentially, bo2(s) = 0, and by2(s) = 0. 
This is impossible because Uj (s) is invertible. 


If dj = dz = 1 then, using (46), 


bo (s) = PMH +4Fam(s) _ PRE eH) +H) 
= —shyy(s) + (s? + 1)uni(s) = —s athe + (s? + 1)uai(s) (49) 


— shy (s)+(s?+1)11* up (s) 
(1) 


Notice that d(—sh,(s)) < 1+2q; and d((s* +1)™+1up1(s)) = 2(q, +1) + d(ug1(s)) > 2g) +2 
unless u1(s) = 0. Hence, if uz1(s) 4 0, d(—shy(s) + (s* +. 1)™+1u91(s)) > 2q, + 2 which is 
greater than d((s* + 1)7!) = 2q1. This cannot happen because b>; (s) is proper. Thus, u21(s) = 
0. In the same way and reasoning with (48) we get that u9(s) is also zero. This is again 
impossible because Up(s) is invertible. Therefore no left Wiener—Hopf factorization of P(s) 
with respect to (M, M’) exits. 


We end this section with an example where the left Wiener—Hopf factorization indices of the 
matrix polynomial in the previous example are computed. Then an ideal generated by a 
polynomial of degree 1 is added to M and the Wiener—Hopf factorization indices of the same 
matrix are obtained in two different cases. 


Example 23. Let F = Rand M = {(s* + 1)}. Consider the matrix 
s 0 
P(s) = E 2 a: (50) 
which has a zero at 0. It can be written as P(s) = P,(s)P2(s) with 


KO=| pip) BOA(51 | 61) 


where the global invariant factors of P,(s) are powers of s* +1 and the global invariant 
factors of P(s) are relatively prime with s? + 1. Moreover, the left Wiener-Hopf factorization 
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indices of P;(s) at infinity are 3, 1 (add the first column multiplied by s* + 2s to the second 
column; the result is a column proper matrix with column degrees 1 and 3). Therefore, the left 
Wiener—Hopf factorization indices of P(s) with respect to M are 3, 1. 


Consider now M = {(s* + 1),(s)} and M’ = Specm(IR[s]) \ M. There is a unimodular matrix 
1s*+2]. ey 2x2 s s+2s]. 
Uu(s) = 01 | invertible in Ry, (s)***, such that P(s)U(s) = a is column 
proper with column degrees 3 and 2. We can write 
P(syu(s) =| ° 2 sx? 0 4/825] ~ B(s) s? 0 (52) 
~ [-10] [08° O1| Os? 
where B(s) is the following biproper matrix 
a f-0.4 goal ieo Oo]. | 22 
w= ([9.3)+ [62] fe l- [42], ei 
Moreover, the denominators of its entries are powers of s and det B(s) = ia as Therefore, 


B(s) is invertible in R yp (s)?%* M Rpr(s)?*?. Since B(s)~'P(s)U(s) = Diag(s?,s°), the left 
Wiener—Hopf factorization indices of P(s) with respect to M are 3, 2. 


If M = {(s* +1), (s —1)}, for example, a similar procedure shows that P(s) has 3,1 as left 
Wiener—Hopf factorization indices with respect to M; the same indices as with respect to M. 
The reason is that s — 1 is not a divisor of det P(s) and so P(s) = P,(s)P2(s) with P,(s) and 
P)(s) as in (51) and P,(s) factorizing in M. 


Remark 24. It must be noticed that a procedure has been given to compute, at least 
theoretically, the left Wiener—-Hopf factorization indices of any rational matrix with respect 
to any subset M of Specm(F[s]). In fact, given a rational matrix T(s) and M, write T(s) = 
Ti(s)Tr(s) with the global invariant rational functions of Tz(s) factorizing in M, and the 
global invariant rational functions of Tr(s) factorizing in Specm(F{s]) \ M (for example, 
using the global Smith-McMillan form of T(s)). We need to compute the left Wiener-Hopf 
factorization indices at infinity of the rational matrix T,(s). The idea is as follows: Let d(s) be 
the monic least common denominator of all the elements of Ty, (s). The matrix T,(s) can be 
written as Ty (s) = Mee with P(s) polynomial. The left Wiener—Hopf factorization indices 
of P(s) at infinity are the column degrees of any column proper matrix right equivalent 
to P(s). If ky,...,km are the left Wiener—Hopf factorization indices at infinity of P(s) then 
ky +d,...,kn +d are the left Wiener-Hopf factorization indices of T,(s), where d = d(d(s)) 
(see [1]). Free and commercial software exists that compute such column degrees. 


5. Rosenbrock’s Theorem via local rings 


As said in the Introduction, Rosenbrock’s Theorem ([14]) on pole assignment by state feedback 
provides, in its polynomial formulation, a complete characterization of the relationship 
between the invariant factors and the left Wiener-Hopf factorization indices at infinity of any 
non-singular matrix polynomial. The precise statement of this result is the following theorem: 
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Theorem 25. Let 9) > --- > gm and o,(s) | --- | &m(s) be non-negative integers and 
monic polynomials, respectively. Then there exists a non-singular matrix P(s) € F[s|™*™ with 
«1(S8),...,Qm(s) as invariant factors and g1,...,%m as left Wiener—Hopf factorization indices at 
infinity if and only if the following relation holds: 


(81,-+-/8m) < (d(am(s)),---,d(a4 (s)))- (54) 


Symbol ~ appearing in (54) is the majorization symbol (see [11]) and it is defined as follows: If 
(a1,...,@m) and (b1,...,bim) are two finite sequences of real numbers and ayy) >. > Amn) and 
by) 2 +++ & dj are the given sequences arranged in non-increasing order then (ay,...,4m) ~ 
(b1,..-, Dm) if 


i 
aS Leb, 1sjs<m-1 (55) 
with equality for j = m. 


The above Theorem 25 can be extended to cover rational matrix functions. Any rational matrix 
(s) 


T(s) can be written as 1G; where d(s) is the monic least common denominator of all the 


elements of T(s) and N(s) is polynomial. It turns out that the invariant rational functions of 
T(s) are the invariant factors of N(s) divided by d(s) after canceling common factors. We also 
have the following characterization of the left Wiener- Hopf factorization indices at infinity of 
T(s): these are those of N(s) plus the degree of d(s) (see [1]). Bearing all this in mind one can 
easily prove (see [1]) 


Ar ee aaa irreducible rational functions, 


where «;(s),Bi(s) € F[s] are monic such that «1(s) | --- | &m(s) while Bin(s) |---| Bi(s). Then 
there exists a non-singular rational matrix T(s) € F(s)"*™ with 91,...,%m as left Wiener-Hopf 
a1(s) Xm(s) 
Bi(s)’*""” Bu(s) 


(S1,--+,8m) < (d(am(s)) — 4(Bm(s)),---,d(a1(s)) — 4(Br(s)))- (56) 


Theorem 26. Let g; > --- > &m be integers and 


factorization indices at infinity and 


if 


as global invariant rational functions if and only 


Recall that for M C Specm(F[s]) any rational matrix T(s) can be factorized into two matrices 
(see Proposition 2) such that the global invariant rational functions and the left Wiener-Hopf 
factorization indices at infinity of the left factor of T(s) give the invariant rational functions 
and the left Wiener—Hopf factorization indices of T(s) with respect to M. Using Theorem 26 
on the left factor of T(s) we get: 


Theorem 27. Let M C Specm(F[s]). Let kj > --- > km be integers and ue, ehh ae be 
irreducible rational functions such that €;(s) |---| €m(s), Pm(s) | «++ | ~1(s) are monic polynomials 
factorizing in M. Then there exists a non-singular matrix T(s) € F(s)"™*™ with an, wag cut} as 
invariant rational functions with respect to M and ky,...,km as left Wiener—Hopf factorization indices 


with respect to M if and only if 
(k1,-++kim) < (d(€m(s)) — 4(Hm(s)),---,d(er(s)) — 4(1(s))). (57) 


aoe 
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Theorem 27 relates the left Wiener-Hopf factorization indices with respect to M and the finite 
structure inside M. Our last result will relate the left Wiener—-Hopf factorization indices with 
respect to M and the structure outside M, including that at infinity. The next Theorem is an 
extension of Rosenbrock’s Theorem to the point at infinity, which was proved in [1]: 


Theorem 28. Let 9] > --- > &m and q, > --+ > qm be integers. Then there exists a non-singular 
matrix T(s) € F(s)"*™ with 91,...,8m as left Wiener—Hopf factorization indices at infinity and 
sf1,...,8%™ as invariant rational functions at infinity if and only if 


(S1,+++/8m) < (91s+++/ 4m): (58) 


Notice that Theorem 26 can be obtained from Theorem 27 when M = Specm(F{s]). In 
the same way, taking into account that the equivalence at infinity is a particular case of the 
equivalence in Fyy(s) M Fp;(s) when M’ = @, we can give a more general result than that of 
Theorem 28. Specifically, necessary and sufficient conditions can be provided for the existence 
of a non-singular rational matrix with prescribed left Wiener—-Hopf factorization indices with 
respect to M and invariant rational functions in F yy (s) 9 Fpr(s). 


Theorem 29. Let M,M’ C Specm(FF[s]) be such that MUM’ = Specm(F[s]). Assume that 
there are ideals in M\ M!' generated by linear polynomials and let (s — a) be any of them. Let 


_ j ex(s) €m(s) 
k, > > km be integers, Bis)’ ” Pals) 


Wm(s)|---|p1(s) are monic polynomials factorizing in M’ \ M and 1,,...,ln integers such that 
: + d(1(s)) — d(e1(s)) < +++ < Im +d(Wm(s)) — d(€m(s)). Then there exists a non-singular 
matrix T(s) € F(s)"*"™ with no zeros and no poles in MN M! with ky,...,km as left Wiener-Hopf 
€1(s) 1 Em(S) 1 
YW (s) (s—a)"t nee 4 Pn (Ss) (s—a)/m 
functions in Fy (s) OM Fpr(s) if and only if the following condition holds: 


(ky,...,km) ~ (-h,...,-ln). (59) 


irreducible rational functions such that €,(s)|---|€m(s), 


factorization indices with respect to M and as invariant rational 


The proof of this theorem will be given along the following two subsections. We will use 
several auxiliary results that will be stated and proved when needed. 


5.1. Necessity 


We can give the following result for rational matrices using a similar result given in Lemma 
4.2 in [2] for matrix polynomials. 


Lemma 30. Let M, M’ C Specm(F[s]) be such that MU M' = Specm(F{s]). Let T(s) € F(s)™*™ 
be a non-singular matrix with no zeros and no poles in MN M!' with gy > --- > &m as left 
Wiener—Hopf factorization indices at infinity and ky > --- > km as left Wiener—Hopf factorization 
€1( €m(s) 

(s)’"""’ Pm(s) 


are the invariant rational functions of T(s) with respect 
to M’ then 


(81 — ky, +-/8m — Km) < (d(Em(s)) — d(Wm(s)),---,d(er(s)) — d(pr(s)))- (60) 


It must be pointed out that (¢1 — k1,...,8m — km) may be an unordered m-tuple. 
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Proof.- By Proposition 2 there exist unimodular matrices U(s), V(s) € F[s]"*™ such that 
‘ 1(s) am (Ss) ia €1(s) Em(S) . 
Ts) = UUs) Ding (BS) ay) PMB CGreay gma) VE) 
with aj(s) | a+1(s), Bi(s) | Bi-1(s), €i(s) | €i+1(s), Pils) | Pi-1(s), ai(s), Bi(s) units in 


Fy\m(s) and e;(s), #;(s) factorizing in M’ \ M because Ms ) has no poles and no zeros in MM 
M’. Therefore T(s) = T,(s)Tr(s), where T,(s) = U(s) Diag (aa.. vay a) has ky,..., kin 
as left Wiener-Hopf factorization indices at infinity and Tr(s) = Diag (28, vo, Emly) ) V(s) 


(s)’ ” Pn(s) 

has ae, nes ie = as global invariant rational functions. Let d(s) = 6,(s)(s). Hence, 
d(s)T(s) = U(s) Diag(a,(s),...,®m(s)) Diag(€1(s),...,€m(s))V(s) (62) 
with a;(s) = a 1(s) units in Fy y(s) and &;(s) (s) factorizing in M’ \ 


M. Put P(s) = d(s)T(s). Its left Wiener—Hopf factorization indices at infinity are 9, + 
d(d(s)),...,2m +d(d(s)) [1, Lemma 2.3]. The matrix P,(s) = U(s) Diag(@(s),...,@m(s)) = 
B1(s)Tr(s) has ky + d(By1(s)),...,km + d(B1(s)) as left Wiener-Hopf factorization indices at 
infinity. Now if P2(s) = Diag(&( ),.+-,€m(s))V(s) = 1(s)Tr(s) then its invariant factors 
are €1(s),...,€m(s), P(s) = Py(s)P2(s) and, by [2, Lemma 4.2], 


(g1 + d(d(s)) — ky — d(Bi(s)),---,8m + 4(d(s)) — km — 4(Bi(s))) < (4(Em(s)),-- as 
Therefore, (60) follows. | 


5.1.1. Proof of Theorem 29: Necessity 
If €1(s) L Em(S) 


1 
1 (s) (s—a)1’°**’ Pus) (s—a)in 
then there exist matrices Uj (s), Uz(s) invertible in Fay (s)"*" M Fp; (s)”"*™ such that 


are the invariant rational functions of T(s) in Fyy(s) M Fpr(s) 


a= s) Dia €1(s) 1 Em/(S) ik S 
T(s) = U;(s)D &( 25 yee crak) Up(s). (64) 
h 


We analyze first the finite structure of T(s) with respect to M’. If D,(s) = Diag((s — a)~", 
.,(s—a)~'") © Fyyp(s)™*", we can write T(s) as follows: 


€1(s) Em(S) 
Yils)"" Pn(s) 


with U;(s) and D;(s)U2(s) invertible matrices in Fyy(s)"*™. Thus s, hing 2 are the 


invariant rational functions of T(s) with respect to M’. Let g; > --- > gm be the left 
Wiener—Hopf factorization indices of T(s) at infinity. By Lemma 30 we have 


(81 — kt, +-/8m — Km) < (d(Em(s)) — d(Wm(s)),---,d(er(s)) — d(pr(s)))- (66) 


T(s) = Un(s) Diag ( ) Pr(s)uat), (65) 


As far as the structure of T(s) at infinity is concerned, let 


—Leg(s) sh tdCdi(s))=d1er(8)) egg (s) ln tl n(6))—aCEn() 
Dp(s) = Diag (ee: aa eae e=0F : (67) 
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Then D2(s) € Gl(Fpr(s)) and 
T(s) = Uy(s) Diag (s-#-HHO) +41), ge -AC Puls) +4(Em())) Do (s)Up(s) (68) 


where Uj(s) € Fp;(s)”*” and D2(s)Uz(s) € Fpr(s)”*” are biproper matrices. Therefore 
sh-a(tr(s))+4(er(s)) ||, gin —a(Pm(8))+4(€m(S)) are the invariant rational functions of T(s) at 
infinity. By Theorem 28 


(81--+/8m) < (—h — d(x (s)) + d(E1(8)),--+, ln — (m(s)) +4(Em(s)))- (69) 


Let 7 € Xi, (the symmetric group of order m) be a permutation such that g,(1) — ky) 2++- 2 
8o(m) — Kom) and define cj = g,() — kg(j), 1 = 1,...,m. Using (66) and (69) we obtain 


Ek + E(dle(s)) — ays) < EHt Lg 
j=l j=l j= j=m-r+1 ; 
<Li+Ug-h= Leg; (70) 
j=l =1 j=l 
< L-l+  dlej(s)) — 4(y,(s))) 
=I j=l 


forr =1,...,m—1. Whenr = m the previous inequalities are all equalities and condition (59) 
is satisfied. ] 


Remark 31. It has been seen in the above proof that if a matrix has 

e€1(s) 1 Em(S) 1 

oi(s) (s—a)" 2 "1 hn (s) (s—a)im 

0 cor i s are its invariant rational functions with respect to M’ and sd (s))+d(er(s)) | 
Im—d(pm(s))+4(Em(S)) are its invariant rational functions at infinity. 


as invariant rational functions in Fyy(s) M Fpr(s) then 


5.2. Sufficiency 
Let a,b € F be arbitrary elements such that ab 4 1. Consider the changes of indeterminate 
1 x 1 
f(s) =e 7 f(s) 


and notice that f(f(s)) = f(f(s)) = s. For a(s) € Ff{s], let F[s] \ (a(s)) denote the 
multiplicative subset of F[s] whose elements are coprime with «(s). For a,b € F as above 


(71) 


define 
F[s] — F{s] \ (s — b) 72) 
n(s) 4 (s— p)4(n(s)) 7 (a 4 ss) = (s— pans) n(f(s)) . 
In words, if 1(s) = pa(s — a)4 + pa_a(s—a)*1 +--+ + p1(s— a) + po (pa # 0) then 
ab(7(S)) = po(s — b)4 + pi(s —b)41 +--+ + pa_a(s — b) + pa. (73) 


In general a m(s))) < d(zt(s)) with equality if and only if 7(s) € F[s] \ (s—a). This 
shows that the restriction h,, : F[s] \ (s—a) — Ff{s] \ (s— b) of tz, to F[s] \ (s—a) isa 
bijection. In addition kh is the restriction of ty, to F[s] \ (s — b);ie., 


hi: F[s] \ (s — b) > F{s] \ (s — a) 


a,b 
a(s) H (s— a)a(a(s)) x (6 +4 5) = (s— a)i(e(s)) a(f(s)) (74) 
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or hes = Mba. 


In what follows we will think of a,b as given elements of F and the subindices of t, », hg» and 
ley will be removed. The following are properties of h (and h—1) that can be easily proved. 


Lemma 32. Let 711(s),712(s) € F[s] \ (s — a). The following properties hold: 
2(s)) = h(711(s) )h(712(s)). 


| 72(s) then h(7(s)) | h(702(s)). 
is an irreducible polynomial then h(71(s)) is an irreducible polynomial. 


1. h(7(s)70 
2. If 7(s) 
3. If 7(s) 
4. If 71(s), %™2(s) are coprime polynomials then h(7t;(s)), h(7t2(s)) are coprime polynomials. 


As a consequence the map 


H : Specm (F[s]) \ {(s — @)} — Speem (F{s]) \ {(s — b)} (75) 
(x(s)) as (g5h(7(s))) 
with po = 7(a), is a bijection whose inverse is 
H~' : Specm (IF[s]) \ {(s — b)} — Specm (F Bee) (76) 


(a(s)) > (at (a#(s))) 


where ay = «(b). In particular, if M’ C Specm(F[s}) \ {(s — a)} and M = Specm(F{s]) \ (M’U 
{(s —a)}) (i.e. the complementary subset of M’ in Specm (F[s]) \ {(s — a) }) then 


H(M) = Specm (F[s]) \ (H(M’) U {(s — b)}). (77) 
In what follows and for notational simplicity we will assume b = 0. 


Lemma 33. Let M’ C Specm (F[s]) \ {(s — a)} where a € F is an arbitrary element of F. 


1. If (s) € F[s] factorizes in M' then h(71(s)) factorizes in H(M’). 
2. If m(s) € F[s] is a unit of Fy(s) then t(7(s)) is a unit of Fry) (s). 


Proof.- 1. Let 77(s) = c7t1(s)8" - - - 7m(s)8™ with c 4 0 constant, (71;(s)) € M’ and g; > 1. Then 
h(7(s)) = c(h(7y(s)))& +: - (A(7tm(s)))8". By Lemma 32 h(77;(s)) is an irreducible polynomial 
(that may not be monic). If c; is the leading coefficient of h(7;(s)) then zh(7;(s)) is monic, 
irreducible and (Zh(77;(s))) € H(M’). Hence h(zt(s)) factorizes in H(M’). 

2. If m(s) € F[s] is a unit of Fyy(s) then it can be written as 7t(s) = (s — a)&711(s) where 
g = Oand 7; (s) is a unit of F y44(s_q)} (S$). Therefore 7 (s) factorizes in Specm(F[s]) \ (M’ U 
{(s —a)}). Since t(7(s)) = h(7t,(s)), it factorizes in (recall that we are assuming b = 0) 
H(Specm(F[s]) \ (M'U {(s — a)}) = Specm(F[s]) \ (H(M") U {(s)}). So, t(7t(s)) is a unit of 
Fu) (s). |_| 


Lemma 34. Let a € F be an arbitrary element. Then 


1. If M! C Specm (F[s]) \ {(s — 4) } and U(s) € Glnn(Fyr(s)) then U(f(s)) € Gl (Frwy (s)).- 
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2. IfU(s) € Gln(Fs—a(s)) then U(f(s)) € Gln (Fpr(s)). 
3. If U(s) € Glin(Fpr(s)) then U(f(s)) € Gln (Fs(s)). 
4. If (s—a) € M' C Specm(EF[s]) and U(s) € Gln(Fy(s)) then the matrix U(f(s)) € 


Glin (F u(r {(s—a) a) (3 8)) A Glin (F pr(s)) 


Proof.- Let re with p(s),q(s) € F{s]. 


p(f(s)) _ s#)) p(F(s)) a(q(s))-a(p(s)) _ t(p(s)) d(q(s))—A(p(s)). (78) 


1. Assume that U(s) € Gln(Fy(s)) and let re ue be any element of U(s). Therefore q(s) is a 


unit of Fy (s) and, by Lemma 33.2, t(q(s)) is a unit of Fy4(,47)(s). Moreover, s is also a unit of 


FHm)(s)- Hence, Aes € Fy)(s). Furthermore, if det U(s) = pe, it is a unit of Fay (s) 
and det U(f(s)) = mae is a unit of Fyy(,47)(s). 


2. If Be is any element of U(s) € Glmn(Fs—a(s)) then q(s) € Ff{s] \ (s — a) and so 
d(h(q(s))) = d(q(s)). Since s — a may divide p(s) we have that d(t(p(s))) < d(p(s)). Hence, 
d(h(q(s))) — d(q(s)) = d(t(p(s)) — d(p(s)) and BER = AER) sAla(s))—AP) € Bpr(s) 


Moreover if detU(s) = i then p(s),G(s) € F[s] \ (s— a), d(h(p(s))) = d(p(s)) and 
p 


d(h(q(s))) = d(q(s)). Thus, det U(f(s)) = J2Ss 
ie., a unit of Fpr(s). 
( 


4(q(s))—4(P(s)) ig a biproper rational function, 


3. If U(s) € Gln(Fpr(s)) and ne is any element of U(s) then d(q(s)) > d(p(s)). Since 


PUf(s)) _ pls) sd(q(s))—d(P(s)) and t(p(s)),t(q(s)) € Els] \ (s) we obtain that U(f(s)) € 


F,(s)"*™. In addition, if detU(s) = me i, which is a unit of F,,(s), then d(q(s)) = d(p(s)) 


and since t(p(s)),t(q(s)) € F[s] \ (s) we conclude that det U(f(s)) = feet is a unit of F.(s). 


4. It is a consequence of 1., 2. and Remark 1.2. | 


Proposition 35. Let M C Specm(EF|s]) and (s — a) € M. If T(s) € F(s)"*"” is non-singular with 


He = (s—a)8i a (e;(s), p;(s) € F[s] \ (s — a)) as invariant rational functions with respect to M 


then T(f(s))? € F(s)"*™ is a non-singular matrix with + 1 ey s—8itd(pi(s))—4(€i(S)) as invariant 


ei (a) 


pia)’ 


mxm 


rational functions in F 54/4 {(s—a)})(8)""*"™" O Fpr(s) where c; = 


Proof.- Since (s — a)8i a are the invariant rational functions of T(s) with respect to M, there 
are U;(s),U2(s) € Glin (Fyy(s)) such that 


T(s) = Uy(s) Diag (“ ayn fuls) (sg a)sin a | Up(s). (79) 


$1 (s) Ym (8) 
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Notice that (f(s) — a)% ci(f(s)) _ hlei(s)) --gi+4(pi(s))—-d(€()), Let c; = £42, which is a 


pilf(s)) h(yi(s)) pila)’ 
non-zero constant, and put D = Diag (c1,...,Cm). Hence, 
T(f(s))? = Up(f(s))"DL(s)Uy (f(s))* (80) 
with 
; 1 h(ei(s)) _-¢, +a = 1 h(€m(s)) _ 7 
L —D 81 pi(s)) d(er(s)) 32 &m+d(Pm(s))—d(Em(s)) 
(5) oe (~ h(yy (s))° Cm h(Pm(s)) 
(81) 
By 4 of Lemma 34 matrices U;(f(s))", Us(f(s))? € Glin (Fi(m\{(s—a)}) ($)) A Glm (Fpr(s)) and 
the Proposition follows. | 


Proposition 36. Let M,M’ C Specm(F[s]) such that MUM! = Specm(IF[s]). Assume that 
there are ideals in M \ M' generated by linear polynomials and let (s — a) be any of them. If T(s) € 
F(s)"*" is a non-singular rational matrix with no poles and no zeros in MM M’ and ky,...,km as 
left Wiener-Hopf factorization indices with respect to M then T(f(s))' € F(s)"*"™ is a non-singular 
rational matrix with no poles and no zeros in H(MM M’) and —km,...,—ky as left Wiener-Hopf 
factorization indices with respect to H(M’) U {(s)}. 


Proof.- By Theorem 19 there are matrices U;(s) invertible in Fy (s)”*" 1 Fp;(s)”*™ and 
Un(s) invertible in Fyy(s)*™ such that T(s) = U;(s) Diag ((s = a)™ peony (8 — ayn) Up(s). 
By Lemma 34 Up(f(s))? is invertible in Fr(m\{(s—a)}) (8) O Fpr(s)™*™" and Uy (f(s))7 is 
invertible in Fy (s)”*" OFs(s)"*™" = Frcyut(s)} (8)""*"™-. Moreover, H(M \ {(s—a)})U 
H(M’)U {(s)} = Specm(F[s]) and H(M \ {(s — a)}) NM (H(M’) U {(s)}) = H(MNM’). Thus, 
T(f(s))? = Un(f(s))? Diag Ge f25 gin) U;(f(s))? has no poles and no zeros in H(MNM 
M') and —ky,...,—k, are its left Wiener-Hopf factorization indices with respect to H(M’) U 


{(s)}- 7 
5.2.1. Proof of Theorem 29: Sufficiency 

Let kj > --- > km be integers, ae alee 7 i. irreducible rational functions such that €1(s) | 
++ | €m(s), Pm(s) | +--+ | Y1(s) are monic polynomials factorizing in M’ \ M and l,...,Ln 


integers such that 1; + d(1(s)) — d(e1(s)) < +--+ < Im +d(m(s)) — d(€m(s)) and satisfying 
(59). 

Since €;(s) and w;(s) are coprime polynomials that factorize in M’ \ M and (s—a) € 

;: h(ex(s)) <1, +d(y1(s))—d(ex(s)) N(Em(S)) gln-+d (Wm (s))—d(€Em(s)) 

M \ M’, by Lemmas 32 and 33, i@i(s))> 1 1 EGGS are 
irreducible rational functions with numerators and denominators polynomials factorizing in 
H(M')U {(s)} (actually, in H(M’ \ M) U {(s)}) and such that each numerator divides the next 
one and each denominator divides the previous one. 


By (59) and Theorem 27 there is a matrix G(s) ©€ [F(s)"*" with —ky,...,—k, 
as left Wiener-Hopf factorization indices with respect to H(M’) U {(s)} and 
1 Nex(s)) gh +d(r(s))—d(e1(s)) A h(Em(S)) tna (Wm (s))—d (em (s)) 
ci h(yn(s)) "4 Cu A(Ym(s) 


with respect to H(M’) U {(s)} where c; = tay’ 


as invariant rational functions 


i = 1,...,m. Notice that G(s) has no zeros 
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and poles in H(MN M’) because the numerator and denominator of each rational function 
Fegaeysh tas) Ale) factorizes in H(M’ \ M) U {(s)} and so it is a unit of Fymnm)(s): 


i 


Put M = H(M’)U{(s)} and M’ = H(M\ {(s—a)}). As remarked in the proof of Proposition 
36, MU M!' = Specm(EF[s]) and Mm M' = H(MM M’). Now (s) € M so that we can apply 


Proposition 35 to G(s) with the change of indeterminate f(s) = =+,. Thus the invariant 
rational functions of G(f(s))? in Fy (s) M Fpr(s) are oe ea ee eat an ; 


On the other hand M’ = H(M \ {(s—a)}) C Specm(F{s]) \ {(s)} and so (s) © M\ M’. 
Then we can apply Proposition 36 to G(s) with f(s) = <4, so that G(f(s))? is a non-singular 
matrix with no poles and no zeros in H~!(MN M’) = H-!(H(MNM')) = MOM! and 
ky,..-,km as left Wiener—-Hopf factorization indices with respect to H~! (M’) U {(s—a)} = 
(M \ {(s—a)}) U {(s — a)} = M. The theorem follows by letting T(s) = G(f(s))?. a 


Remark 37. Notice that when M’ = © and M = Specm(F[s]) in Theorem 29 we obtain 
Theorem 28 (q; = —1)). 
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1. Introduction 


Quantum field theory is the most universal method in physics, applied to all the area from 
condensed-matter physics to high-energy physics. The standard tool to deal with quantum 
field theory is the perturbation method, which is quite useful if we know the vacuum 
of the system, namely the starting point of our analysis. On the other hand, sometimes 
the vacuum itself is not obvious due to the quantum nature of the system. In that case, 
since the perturbative method is not available any longer, we have to treat the theory in a 
non-perturbative way. 


Supersymmetric gauge theory plays an important role in study on the non-perturbative 
aspects of quantum field theory. The milestone paper by Seiberg and Witten proposed a 
solution to N = 2 supersymmetric gauge theory [48, 49], which completely describes the low 
energy effective behavior of the theory. Their solution can be written down by an auxiliary 
complex curve, called Seiberg-Witten curve, but its meaning was not yet clear and the origin 
was still mysterious. Since the establishment of Seiberg-Witten theory, tremendous number of 
works are devoted to understand the Seiberg-Witten’s solution, not only by physicists but also 
mathematicians. In this sense the solution was not a solution at that time, but just a starting 
point of the exploration. 


One of the most remarkable progress in N = 2 theories referring to Seiberg-Witten theory 
is then the exact derivation of the gauge theory partition function by performing the integral 
over the instanton moduli space [43]. The partition function is written down by multiple 
partitions, thus we can discuss it in a combinatorial way. It was mathematically proved 
that the partition function correctly reproduces the Seiberg-Witten solution. This means 
Seiberg-Witten theory was mathematically established at that time. 


The recent progress on the four dimensional N= 2 supersymmetric gauge theory has 
revealed a remarkable relation to the two dimensional conformal field theory [1]. This relation 
provides the explicit interpretation for the partition function of the four dimensional gauge 
theory as the conformal block of the two dimensional Liouville field theory. It is naturally 
regarded as a consequence of the M-brane compactifications [23, 60], and also reproduces 
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the results of Seiberg-Witten theory. It shows how Seiberg-Witten curve characterizes the 
corresponding four dimensional gauge theory, and thus we can obtain a novel viewpoint of 
Seiberg-Witten theory. 


Based on the connection between the two and four dimensional theories, established results 
on the two dimensional side can be reconsidered from the viewpoint of the four dimensional 
theory, and vice versa. One of the useful applications is the matrix model description of the 
supersymmetric gauge theory [12, 16, 17, 47]. This is based on the fact that the conformal block 
on the sphere can be also regarded as the matrix integral, which is called Dotsenko-Fateev 
integral representation [14, 15]. In this direction some extensions of the matrix model 
description are performed by starting with the two dimensional conformal field theory. 


Another type of the matrix model is also investigated so far [27, 28, 30, 52, 53]. This is 
apparently different from the Dotsenko-Fateev type matrix models, but both of them correctly 
reproduce the results of the four dimensional gauge theory, e.g. Seiberg-Witten curve. While 
these studies mainly focus on rederiving the gauge theory results, the present author reveals 
the new kind of Seiberg-Witten curve by studying the corresponding new matrix model 
[27, 28]. Such a matrix models is directly derived from the combinatorial representation 
of the partition function by considering its asymptotic behavior. This treatment is quite 
analogous to the matrix integral representation of the combinatorial object, for example, the 
longest increasing subsequences in random permutations [3], the non-equilibrium stochastic 
model, so-called TASEP [26], and so on (see also [46]). Their remarkable connection to the 
Tracy-Widom distribution [56] can be understood from the viewpoint of the random matrix 
theory through the Robinson-Schensted-Knuth (RSK) correspondence (see e.g. [51]). 


In this article we review such a universal relation between combinatorics and the matrix 
model, and discuss its relation to the gauge theory. The gauge theory consequence can be 
naturally extacted from such a matrix model description. Actually the spectral curve of the 
matrix model can be interpreted as Seiberg-Witten curve for N =2 supersymmetric gauge 
theory. This identification suggests some aspects of the gauge theory are also described by the 
significant universality of the matrix model. 


This article is organized as follows. In section 2 we introduce statistical models defined in a 
combinaorial manner. These models are based on the Plancherel measure on a combinatorial 
object, and its origin from the gauge theory perspective is also discussed. In section 3 it is 
shown that the matrix model is derived from the combinatorial model by considering its 
asymptotic limit. There are various matrix integral representations, corresponding to some 
deformations of the combinatorial model. In section 4 we investigate the large matrix size 
limit of the matrix model. It is pointed out that the algebraic curve is quite useful to study 
one-point function. Its relation to Seiberg-Witten theory is also discussed. Section 5 is devoted 
to conclusion. 


2. Combinatorial partition function 


In this section we introduce several kinds of combinatorial models. Their partition functions 
are defined as summation over partitions with a certain weight function, which is called 
Plancherel measure. It is also shown that such a combinatorial partition function is obtained 
by performing the path integral for supersymmetric gauge theories. 
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Figure 1. Graphical representation of a partition A = (5,4,3,1,1) and its transposed partition 
A= (5,3,2,2,1) by the associated Young diagrams. There are 5 non-zero entries in both of them, 
(A) = Ay = 5 and £(A) = Ay =5. 


2.1. Random partition model 


Let us first recall a partition of a positive integer n: it is a way of writing n as a sum of positive 
integers 


A= (Az, A2,° an Agay) (1) 
satisfying the following conditions, 
e(A) 
n=) ASIA Ay 2 A2 2 -+ Aggy > 0 (2) 


Here £(A) is the number of non-zero entries in A. Now it is convenient to define A; = 0 for 
i > &(A). Fig. 2 shows Young diagram, which graphically describes a partition A = (5,4,2,1,1) 
with (A) =5. 


It is known that the partition is quite usefull for representation theory. We can obtain an 
irreducible representation of symmetric group G,, which is in one-to-one correspondence 
with a partition A with |A| = n. For such a finite group, one can define a natural measure, 
which is called Plancherel measure, 


dim A)? 
pn(a) = mA @) 
This measure is normalized as 
x Un(A) =1 (4) 
A s.t. |Al|=n 


It is also interpreted as Fourier transform of Haar measure on the group. This measure has 
another useful representation, which is described in a combinatorial way, 


Hn(A) =n! Il ; (5) 
(i,j) jer ae 
This h(i, j) is called hook length, which is defined with arm length and leg length, 
AG) =a) HIG )+L 
a(i,j) =Ai-j, (6) 


Gj) =Ay- 


Here A stands for the transposed partition. Thus the height of a partition A can be explicitly 
written as ((A) = Aq. 


va 
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J 

Figure 2. Combinatorics of Young diagram. Definitions of hook, arm and leg lengths are shown in (6). 
For the shaded box in this figure, a(2,3) = 4, 1(2,3) = 3, and h(2,3) = 8. 

With this combinatorial measure, we now introduce the following partition function, 


Gy Tg ”) 
Zuay =L {= — 
U(1) E hi ek h(i, j)? 


This model is often called random partition model. Here A is regarded as a parameter like a 
chemical potential, or a fugacity, and h stands for the size of boxes. 


Note that a deformed model, which includes higher Casimir potentials, is also investigated in 


detail [19], 
1 


Zrigher = IT gape Te 8 (8) 
X (ijyea OI) pa 

In this case the chemical potential term is absorbed by the linear potential term. There is an 

interesting interpretation of this deformation in terms of topological string, gauge theory and 

so on [18, 38]. 


In order to compute the U(1) partition function it is useful to rewrite it in a “canonical form” 
instead of the “grand canonical form” which is originally shown in (7), 


A 2n 1 
Zu(1) — ‘y =) i h(i, j)2 (9) 


n=0A s.t. |A|=n ( h ij)EA 


Due to the normalization condition (4), this partition function can be computed as 


A 2 


Although this is explicitly solvable, its universal property and explicit connections to 
other models are not yet obvious. We will show, in section 3 and section 4, the matrix 
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model description plays an important role in discussing such an interesting aspect of the 
combinatorial model. 


Now let us remark one interesting observation, which is partially related to the 
following discussion. The combinatorial partition function (7) has another field theoretical 
representation using the free boson field [44]. We now consider the following coherent state, 


Ip) = exp (Fo) 0) (11) 


Here we introduce Heisenberg algebra, satisfying the commutation relation, [an,4m| = 
NOn+m,o, and the vacuum |0) annihilated by any positive modes, ay,|0) = 0 for n > 0. Then it 
is easy to show the norm of this state gives rise to the partition function, 


Zui) = (Pl) (12) 


Similar kinds of observation is also performed for generalized combinatorial models 
introduced in section 2.2 [22, 44, 55]. 


Let us then introduce some generalizations of the U(1) model. First is what we call B-deformed 
model including an arbitrary parameter f € R, 


2|A| 
(6) _ (7) 1 
251) = = —— (13) 
U(1) Lo i A, hg (i,j) (i,j) 

Here we involve the deformed hook lengths, 


hg(i,j) =a) + Bl j) +1, WP, j) =ali,j) + BIG, j) + B (14) 


This generalized model corresponds to Jack polynomial, which is a kind of symmetric 
polynomial obtained by introducing a free parameter to Schur polynomial [34]. This 
Jack polynomial is applied to several physical theories: quantum integrable model called 
Calogero-Sutherland model [10, 54], quantum Hall effect [4-6] and so on. 


Second is a further generalized model involving two free parameters, 


2|A| -1 
(qt) _ | (1—q)(1—q™*) 
Za) = = = a 15 
» ( fh nae (1 — gtd) HGH) (1 — g-aG ¢-!G/)-1) 7) 


This is just a q-analog of the previous combinatorial model. One can see this is reduced to 
the B-deformed model (13) in the limit of q — 1 with fixing t = gh. This generalization 
is also related to the symmetric polynomial, which is called Macdonald polynomial [34]. This 
symmetric polynomial is used to study Ruijsenaars-Schneider model [45], and the stochastic 
process based on this function has been recently proposed [8]. 


Next is Z,-generalization of the model, which is defined as 


bes Bis) ee oe 
orbifold,U(1) : h T-invca h(i, iy 
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Figure 3. T-invariant sector for U(1) theory with A = (8,5,5,4,2,2,2,1). Numbers in boxes stand for 


their hook lengths h(i,j) = Aj —j+A j ~1+1. Shaded boxes are invariant under the action of ! = Z3. 
Here the product is taken only for the I’-invariant sector as shown in Fig. 3, 


h(i,j) =a(i,j) +1 j) +1=0 (modr) (17) 


This restriction is considered in order to study the four dimensional supersymmetric gauge 
theory on orbifold R4/Z, =~ C2/Z, [11, 20, 27], thus we call this orbifold partition function. This 
also corresponds to a certain symmetric polynomial [57] (see also [32]), which is related to the 
Calogero-Sutherland model involving spin degrees of freedom. We can further generalize this 
model (16) to the B- or the q-deformed Z,-orbifold model, and the generic toric orbifold model 
[28]. 


Let us comment on a relation between the orbifold partition function and the q-deformed 
model. Taking the limit of q + 1, the latter is reduced to the U(1) model because the q-integer 
is just replaced by the usual integer in such a limit, 


alae 


[x]q = eo (18) 


This can be easily shown by I’Hopital’s rule and so on. On the other hand, parametrizing 
q — w,q with w;, = exp(27i/r) being the primitive r-th root of unity, we have 
1— (wrq)* q>1 { x (x =0, mod r) (19) 
1—(w,q)~} 1 (x £0, modr) 


Therefore the orbifold partition function (16) is derived from the q-deformed one (15) by 
taking this root of unity limit. This prescription is useful to study its asymptotic behavior. 


2.2. Gauge theory partition function 


The path integral in quantum field theory involves some kinds of divergence, which are due 
to infinite degrees of freedom in the theory. On the other hand, we can exactly perform the 
path integral for several highly supersymmetric theories. We now show that the gauge theory 
partition function can be described in a combinatorial way, and yields some extended versions 
of the model we have introduced in section 2.1. 
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The main part of the gauge theory path integral is just evaluation of the moduli space volume 
for a topological excitation, for example, a vortex in two dimensional theory and an instanton 
in four dimensional theory. Here we concentrate on the four dimentional case. See [13, 21, 
50] for the two dimensional vortex partition function. The most usuful method to deal with 
the instanton is ADHM construction [2]. According to this, the instanton moduli space for 
k-instanton in SU(n) gauge theory on R?+, is written as a kind of hyper-Kahler quotient, 


Mrx = {(B1, Ba, LJ)|er = 0, ue = 0} /U(k) (20) 

By» € Hom(C*,C*), I ¢ Hom(C",C*), J ¢ Hom(C*,C") (21) 
vr = [By, BY] + [Bo, Bt] + ut — sty, (22) 

uc = [B1, Bo] + IJ (23) 


The k x k matrix condition 1R = We = 0, and parameters (By, Bz, I, J) satisfying this condition 
are called ADHM equation and ADHM data. Note that they are identified under the following 
U(k) transformation, 


(By, Bo,1,J) ~ (gBig |, gBog gL Jg'),  g © U(k) (24) 


Thus all we have to do is to estimate the volume of this parameter space. However it is well 
known that there are some singularities in this moduli space, so that one has to regularize it in 
order to obtain a meaningful result. Its regularized volume had been derived by applying the 
localization formula to the moduli space integral [41], and it was then shown that the partition 
function correctly reproduces Seiberg-Witten theory [43]. 


We then consider the action of isometries on C? = R* for the ADHM data. If we assign 
(z1,22) + (e*1z1,e'z9) for the spatial coordinate of C?, and U(1)"~! rotation coming from 
the gauge symmetry SU(n), ADHM data transform as 


(Bi, BL J) —+ (T,B1, T:Bp, IT; 1, T1T2Tal) (25) 


where we define the torus actions as Tz = diag(e’™,--- ,e!") € U(1)""1, Ty = el € U(1)?. 
Note that these toric actions are based on the maximal (erie of the gauge theory symmetry, 
U(1)? x U(1)""! Cc SO(4) x SU(n). We have to consider the fixed point of these isometries 
up to gauge transformation g € U(k) to perform the localization formula. 


The localization formula in the instanton moduli space is based on the vector field ¢*, which 
is associated with € € U(1)? x U(1)""1. It generates the one-parameter flow e'¢ on the moduli 
space M, corresponding to the isometries. The vector field is represented by the element of 
the maximal torus of the gauge theory symmetry under the O-background deformation. The 
gauge theory action is invariant under the deformed BRST transformation, whose generator 
satisfies ¢* = {Q*, Q*}/2. Thus this generator can be interpreted as the equivariant derivative 
dz = d+ ig» where ig» stands for the contraction with the vector field ¢*. The localization 


formula i 1S given by 
a(Z) = —2n liad Sew i-a 26 


one ~ 
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where a(¢) is an equivariant form, which is related to the gauge theory action. «(G) is zero 
degree part and Ly, : Tx, M — Tx,M is the map generated by the vector field ¢* at the fixed 
points xo. These fixed points are defined as ¢*(x9) = 0 up to U(k) transformation of the 
instanton moduli space. 


Let us then study the fixed point in the moduli space. The fixed point condition for them are 
obtained from the infinitesimal version of (24) and (25) as 


(1-9) +€x)Buij =9, (Gi -a)TIn =0, (—Gi +4, +), =0 (27) 


where the element of U(k) gauge transformation is diagonalized as el? = diag(e? potty elk) € 
U(k) with e = e, + €. We can show that an eigenvalue of ¢ turns out to be 


ay + (j —1)e, + (iI) e2 (28) 
and the corresponding eigenvector is given by 
tee 
BY BS h (29) 


Since ¢ is a finite dimensional matrix, we can obtain k; independent vectors from (29) with 
ky +---+k, = k. This means that the solution of this condition can be characterized by 
n-tuple Young diagrams, or partitions A= (AQ),. - A”) [42]. Thus the characters of the 
vector spaces are yielding 


n n 
7 1 = 
Vey Se ee, WS, (30) 

1=1 (i,j)eAW 1=1 


and that of the tangent space at the fixed point under the isometries can be represented in 
terms of the n-tuple partition as 


x = V*V(1—T)(1— Th) +W*V + V*WT,To 
n 4) Hl (m)_ KO 541 _y(m) 
=y Xb (Tr es tae gee (31) 
Lm (i,j)EAO 


Here A is a conjugated partition. Therefore the instanton partition function is obtained by 
reading the weight function from the character [43, 44], 


Zsu(n) = Arn lZ, (32) 
A 
ul 1 1 
Z=II Il - - 
* Dam (ijJEAD Aat + €2(AM™) — 7 +1) — eA — 2) aig — €n(AM™ — +14 —i41) 
(33) 


This is regarded as a generalized model of (7) or (13). Furthermore by lifting it to the five 
dimensional theory on R* x S!, one can obtain a generalized version of the q-deformed 
partition function (15). Actually it is easy to see these SU(n) models are reduced to the U(1) 
models in the case of nm = 1. Note, if we take into account other matter contributions in 
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addition to the vector multiplet, this partition function involves the associated combinatorial 
factors. We can extract various properties of the gauge theory from these partition functions, 
especially its asymptotic behavior. 


3. Matrix model description 


In this section we discuss the matrix model description of the combinatorial partition function. 
The matrix integral representation can be treated in a standard manner, which is developed in 
the random matrix theory [40]. 


3.1. Matrix integral 


Let us consider the following N x N matrix integral, 
Zmatrix = [Px e- pT V(X) (34) 


Here X is an hermitian matrix, and DX is the associated matrix measure. This matrix can be 
diagonalized by a unitary transformation, gX¢~! = diag(x1,--- , xy) with g € U(N), and the 
integrand is invariant under this transformation, Tr V(X) = Tr V(gXg~!) = ON, V(x;). On 
the other hand, we have to take care of the matrix measure in (34): the non-trivial Jacobian is 
arising from the matrix diagonalization (see, e.g. [40]), 


DX = Dx DU A(x)? (35) 


The Jacobian part is called Vandermonde determinant, which is written as 


A(x) = [Ii _ x;) (36) 
i<j 
and DU is the Haar measure, which is invariant under unitary transformation, D(gU) = 
DU. The diagonal part is simply given by Dx = iene ,4x;. Therefore, by integrating out 
the off-diagonal part, the matrix integral (34) is reduced to the integral over the matrix 
eigenvalues, 


Zmatrix = [re A(x)? eh oN V(x) (37) 


This expression is up to a constant factor, associated with the volume of the unitary group, 
vol(U(N)), coming from the off-diagonal integral. 


When we consider a real symmetric or a quaternionic self-dual matrix, it can be diagonalized 
by orthogonal/symplectic transformation. In these cases, the Jacobian part is slightly 
modified, 


Z matrix = pe A(x)7P eck a V(xi) (38) 


The power of the Vandermonde determinant is given by 6 = i, 1,2 for symmetric, hermitian 
and self-dual, respcecively.! They correspond to orthogonal, unitary, symplectic ensembles in 
random matrix theory, and the model with a generic 6 € R is called B-ensemble matrix model. 


| This notation is different from the standard one: 26 + B = 1,2,4 for symmetric, hermitian and self-dual matrices. 
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Figure 4. Shape of Young diagram can be represented by introducing one-dimensional exclusive 
particles. Positions of particles would be interpreted as eigenvalues of the matrix. 


3.2. U(1) partition function 


We would like to show an essential connection between the combinatorial partition function 
and the matrix model. By considering the thermodynamical limit of the partition function, it 
can be represented as a matrix integral discussed above. 


Let us start with the most fundamental partition function (7). The main part of its partition 
function is the product all over the boxes in the partition A. After some calculations, we can 
show this combinatorial factor is rewritten as 


1 N N 1 
— =]|(ai-Ajtj-i : 39 
ma To ee tansy eu 


where N is an arbitrary integer satisfying N > £(A). This can be also represented in an infinite 
product form, 


ee ae 
Tl aax- (40) 
Ger MtD tg a 


These expressions correspond to an embedding of the finite dimensional symmetric group 
Gn into the infinite dimensional one Goo. 


By introducing a new set of variables ¢; = Aj + N —i +1, we have another representation of 
the partition function, 


Z A\ 2084 G-N(N41)_N oT 1 41 
u(t) -E(5) [iene [rep 7 


These new variables satisfy ¢; > ¢2 > +++ > €y(,) while the original ones satisfy Ay > Az = 

- > Aga). This means {¢;} and {A;} are interpreted as fermionic and bosonic degrees 
of freedom. Fig. 4 shows the correspondence between the bosinic and fermionic variables. 
The bosonic excitation is regarded as density fluctuation of the fermionic particles around the 
Fermi energy. This is just the bosonization method, which is often used to study quantum 
one-dimensional systems (For example, see [24]). Especially we concentrate only on either of 
the Fermi points. Thus it yields the chiral conformal field theory. 
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We would like to show that the matrix integral form is obtained from the expression (41). First 
we rewrite the summation over partitions as 


LS. 2 = ee a (42) 
A Ay>:>An i>: >éNn # Cir GN 
Then, introducing another variable defined as x; = fi¢;, it can be regarded as a continuous 
variable in the large N limit, 
N—-.o, h—0, hN = O(1) (43) 


This is called ’t Hooft limit. The measure for this variable is given by 


1 
dx; hw N (44) 


Therefore the partition function (41) is rewritten as the following matrix integral, 
Zui) © po Ala) e7 h Lies V(xi) (45) 


Here the matrix potential is derived from the asymptotic behavior of the I-function, 
hlogl(x/h) —> xlogx —~x, h—0 (46) 


Since this variable can take a negative value, the potential term should be simply extended to 
the region of x < 0. Thus, taking into account the fugacity parameter A, the matrix potential 
is given by 
=2/x1 | = | = (47) 
V(x) =2 [x 087 x] 


This is the simplest version of the CP! matrix model [18]. If we start with the partition function 
including the higher Casimir operators (8), the associated integral expression just yields the 
CP! matrix model. 


Let us comment on other possibilities to obtain the matrix model. It is shown that the matrix 
integral form can be derived without taking the large N limit [19]. Anyway one can see that it 
is reduced to the model we discussed above in the large N limit. There is another kind of the 
matrix model derived from the combinatorial partition function by poissonizing the probability 
measure. In this case, only the linear potential is arising in the matrix potential term. Such a 
matrix model is called Bessel-type matrix model, where its short range fluctuation is described 
by the Bessel kernel. 


Next we shall derive the matrix model corresponding to the 6-deformed U(1) model (13). The 
combinatorial part of the partition function is similarly given by 


1 = r(p) NT [iat BU + 8) T(A;-Aj + AG —-i +1) 
Gea Mali {HP 7) icg «TAI Aj+BG—4)) TAi—Aj+BG—71) +1—- 8B) 


(48) 
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In this case we shall introduce the following variables, gl? _ A; + B(N —i) +1 or gl? Ve 


Ai + B(N — i) + B, satisfying @lP = ee) > fB. This means the parameter f characterizes how 
they are exclusive. They satisfy the generalized fractional exclusive statistics for 6 4 1 [25] 
(see also [32]). They are reduced to fermions and bosons for 6B = 1 and f = 0, respectively. 


Then, rescaling the variables, x; = nel? ) the combinatorial part (48) in the ’t Hooft limit yields 


1 oe 
<= AL A(x)? ei Ux, V(x) (a8) 
bh hp (i, j)hP (i,j) 


Here we use I'(w + 8)/T (a) ~ a® with « — oo. The matrix potential obtained here is the same 
as (47). Therefore the matrix model associated with the 6-deformed partition function is given 


by 
ya) 7 [pe A(x)?B e~ h Lins V (3) (50) 
This is just the B-ensemble matrix model shown in (38). 


We can consider the matrix model description of the (q, t)-deformed partition function. In this 
case the combinatorial part of (15) is written as 


1~q —_ a pry —Aj+1 yj-i— 1, et (gh tae (61) 
Gien 1— ted +1eGA) sep GAT co 4 (974) 
_ N (g-Ait+Ajt1y-j+i-1. N 
na pS = (1 ao ae erry = Ila te Se Ty q)e0 
(52) 
AR 


Here (x;9)n = IT, B(1 — xq") is the q-Pochhammer symbol. When we parametrize q = e— 
and t = q?, a set of the variables cgi? yy plays an important role in considering the large N 


limit as well as the 6-deformed model. Thus, rescaling these as x; = ne? ) and taking the ’t 
Hooft limit, we obtain the integral expression of the q-deformed partition function, 


Za) wf Dx (Ap(x))?P e REM Yala) (63) 


The matrix measure and potential are given by 


Ar(x) = I Ss sinh 5 x;) (54) 
Vv -5 [Lin (€8*) — Lip (e-®)] (55) 


We will discuss how to obtain these expressions below. We can see they are reduced to the 
standard ones in the limit of R > 0, 


Ar(x) —> A(x), Vr(x) — V(x) (56) 
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Note that this hyperbolic-type matrix measure is also investigated in the Chern-Simons matrix 
model [35], which is extensively involved with the recent progress on the three dimensional 
supersymmetric gauge theory via the localization method [36]. 


Let us comment on useful formulas to derive the integral expression (53). The measure part is 
relevant to the asymptotic form of the following function, 


(X;q) oo (X3q)oo =(1- x), 


(£X; 4 )oo (tx; q)oo ag 


x —> 0 (57) 


This essentially corresponds to the q — 1 limit of the q-Vandermonde determinant’, 


a2 (x) =P] eo (58) 


ifj (tx; /X)3 4) co 


Then, to investigate the matrix potential term, we now introduce the quantum dilogarithm 


function, so 
g(xq) = IT (1 - <1") (59) 
Its asymptotic expansion is given by (see, eg 19) 
log ¢(x;q = ery = -= 3 Lip_ in (x) om mR)” (60) 


m=0 


where By, is the m-th Bernouilli number, and Lin(x) = Ley xk /k™ is the polylogarithm 
function. The potential term is coming from the leading term of this expression. 


3.3. SU(n) partition function 


Generalizing the result shown in section 3.2, we deal with the combinatorial partition function 
for SU(n) gauge theory (32). Its matrix model description is evolved in [30]. 


The combinatorial factor of the SU(n) partition function (33) can be represented as 


ee O ray’ am) + BG —i)+bim +B) T(B(j —i) + by) 61) 
BAL cajgtmniy TAY) A" + BG =i) + Bim) BUD +P +B) 


where we define parameters as B = —€1/€2, Dim = 4jy,/€2. This is an infinite product 
expression of the partition function. Anyway in this case one can see it is useful to introduce 
n kinds of fermionic variables, corresponding to the n-tupe partition, 


ce) =A + BIN -1) +14, (62) 


? This expression is up to logarithmic term, which can be regarded as the zero mode contribution of the free boson field. 
See [28, 29] for details. 


88 Linear Algebra — Theorems and Applications 


Figure 5. The decomposition of the partition for Z,—3. First suppose the standard correspondence 
between the one-dimensional particles and the original partition, and then rearrange them with respect 
to mod r. 


Then, assuming 0), >> 1, let us introduce a set of variables, 


Cirkae Gi IG ee Gy rey) (63) 
satisfying ¢; > 2 > --- > ¢,n- The combinatorial factor (61) is rewritten with these variables 
as 

1 TGi- oj +B) AY T(-Gi thr +1) 
Zy = (64) 
x en i Rg Ci) lrg, —b-14+ B) 


From this expression we can obtain the matrix model description for SU(n) gauge theory 
partition function, by rescaling x; = h¢; with reparametrizing h = €9, 


Zsu(n) © [Dx A(x)2B @= 8 EI: Vouwn (%) (65) 


In this case the matrix potential is given by 


=a 
A 


Vsu(n) y=2), |« (x — ay) log -(-a)| (66) 


Note that this matrix model is regarded as the U(1) matrix model with external fields a;. We 
will discuss how to extract the gauge theory consequences from this matrix model in section 4. 


3.4. Orbifold partition function 


The matrix model description for the random partition model is also possible for the orbifold 
theory. We would like to derive another kind of the matrix model from the combinatorial 
orbifold partition function (16). We now concentrate on the U(1) orbifold partition function 
for simplicity. See [27, 28] for details of the SU(n) theory. 
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To obtain the matrix integral representation of the combinatorial partition function, we have 
to find the associated one-dimensional particle description of the combinatorial factor. In this 
case, although the combinatorial weight itself is the same as the standard U(1) model, there 
is restriction on its product region. Thus it is useful to introduce another basis obtained by 
dividing the partition as follows, 


{r Cx +N() —i) + uli = peek NO S02 gant = {A;+N-ili=1,---,N} 
(67) 
Fig.5 shows the meaning of this procedure graphically. We now assume N (“) = N for all 
u. With these one-dimensional particles, we now utilize the relation between the orbifold 
partition function and the q-deformed model as discussed in section 2.1. Its calculation is 
quite straightforward, but a little bit complicated. See [27, 28] for details. 


After some computations, we finally obtain the matrix model for the 6-deformed orbifold 
partition function, 


(6) boot rd viel”) 

2 aaa Dz ( ae oe * eb Eit (68) 
In this case, we have a multi-matrix integral representation, since we introduce r kinds of 
partitions from the original partition. The matrix measure and the matrix potential are given 
as follows, 


r—-1 N 
Dz = |] [ax\” (69) 

u=0i=1 

5 r—-1 N 
(oco)” = THT? -2 pe TTT Tae pee go 
u=0 i<j U<O ij 
2 

V(x) = . - — x] (71) 


The matrix measure consists of two parts, interaction between eigenvalues from the same 
matrix and that between eigenvalues from different matrices. Note that in the case of 6B = 1, 
because the interaction part in the matrix measure beteen different matrices is vanishing, this 
multi-matrix model is simply reduced to the one-matrix model. 


4. Large N analysis 


One of the most important aspects of the matrix model is universality arising in the large N 
limit. The universality class described by the matrix model covers huge kinds of the statistical 
models, in particular its characteristic fluctuation rather than the eigenvalue density function. 
In the large N limit, which is regarded as a justification to apply a kind of the mean field 
approximation, anslysis of the matrix model is extremely reduced to the saddle point equation 
and a simple fluctuation around it. 


4.1. Saddle point equation and spectral curve 


Let us first define the prepotential, which is also interpreted as the effective action for the 
eigenvalues, from the matrix integral representation 
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(37), 
F({xj}) = “FEM X;) +23 log( (72) 


i<j 
This is essentially the genus zero part of the prepotential. In the large N limit, in particular 
‘t Hooft limit (43) with Nh = t, we shall investigate the saddle point equation for the matrix 
integral. We can obtain the condition for criticality by differentiating the prepotential, 
1 N 1 
V' (xj) = 2h yi = for all i (73) 
i(Ai) “I 


This is also given by the extremal condition of the effective potential defined as 


N 
Vesa) = Va) — 2h 3) log(x;— x;) (74) 
j(4i) 
This potential involves a logarithmic Coulomb repulsion between eigenvalues. If the ’t Hooft 
coupling is small, the potential term dominates the Coulomb interaction and eigenvalues 


concentrate on extrema of the potential V’(x) = 0. On the other hand, as the coupling gets 
bigger, the eigenvalue distribution is extended. 


To deal with such a situation, we now define the density of eigenvalues, 


1 N 
= »o(x = xi) (75) 


where x; is the solution of the criticality condition (73). In the large N limit, it is natural to think 
this eigenvalue distribution is smeared, and becomes a continuous function. Furthermore, we 
assume the eigenvalues are distributed around the critical points of the potential V(x) as linear 
segments. Thus we generically denote the /-th segment for p(x) as C;, and the total number of 
eigenvalues N splits into n integers for these segments, 


n 
N=)UN (76) 
l=1 


where N, is the number of eigenvalues in the interval C;. The density of eigenvalues p(x) takes 
non-zero value only on the segment C;, and is normalized as 


[, dx p(x) = — =v, (77) 


where we call it filling fraction. According to these fractions, we can introduce the partial ’t 
Hooft parameters, t; = Nh. Note there are n ’t Hooft couplings and filling fractions, but only 
n — 1 fractions are independent since they have to satisfy ))'_, 4; = 1 while all the ’t Hooft 
couplings are independent. 


We then introduce the resolvent for this model as an auxiliary function, a kind of Green 
function. By taking the large N limit, it can be given by the integral representation, 
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w(x) = t [ay pe (78) 


This means that the density of states is regarded as the Hilbert transformation of this resolvent 
function. Indeed the density of states is associated with the discontinuities of the resolvent, 


p(x) = 525, (w(x +e) ~ w(x — ie) (79) 


Thus all we have to do is to determine the resolvent instead of the density of states with 
satisfying the asymptotic behavior, 


i x —> co (80) 


Writing down the prepotential with the density of states, 


F ({xi}) = t [ dxp(x)V(x) PP [ dxdy p(x)ply) log(x — y) (81) 
the criticality condition is given by 


1 
2t 


V'(x) = P [dy pu (82) 


Here P stands for the principal value. Thus this saddle point equation can be also written in 
the following convenient form to discuss its analytic property, 


V' (x) = w(x + ie) + w(x — ie) (83) 


On the other hand, we have another convenient form to treat the saddle point equation, which 
is called loop equation, given by 


(x) — V(x + R(x) =0 (84) 


where we denote 
y(x) = V(x) — 2eo(x) = —20sing (x) (85) 
_ 4t XN v'(x) — V(x) 


ap do 


j=1 


R(x) 


aes (86) 
It is obtained from the saddle point equation by multiplying 1/(x — x;) and taking their 
summation and the large N limit. This representation (84) is more appropriate to reveal its 
geometric meaning. Indeed this algebraic curve is interpreted as the hyperelliptic curve which 
is given by resolving the singular form, 


(x) W(x) =0 (87) 


The genus of the Riemann surface is directly related to the number of cuts of the corresponding 
resolvent. The filling fraction, or the partial ’t Hooft coupling, is simply given by the contour 
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integral on the hyperelliptic curve 


1 


1 => 
' 27i Je, 


ds sing *) =~ gag f, dx yl) (88) 


4.2. Relation to Seiberg-Witten theory 


We now discuss the relation between Seiberg-Witten curve and the matrix model. In the first 
place, the matrix model captures the asymptotic behavior of the combinatorial representation 
of the partition function. The energy functional, which is derived from the asymptotics of the 
partition function [44], in terms of the profile function 


1 x—yJ 3 
E(t) = 4P f_, dxdy "of" wy(a—y? (log (*Z%) - 5) (89) 
Jy<x 
can be rewritten as 
aa dxdy one D> favex vos TT (75 *!) (90) 
up to the perturbative contribution 
1 a; —a 
5 ¥- (a1 — am)* log (5) (91) 
Lm 
by identifying 
n 
— Yo |x —a| = a(x) (92) 
1=1 


Then integrating (90) by parts, we have 


n 


Eq(g) = es dxdy a! (x)o'(y) log(x — y) +2 [ dxol(x) y cE — a1) log € x) et ) 


1=1 
(93) 
This is just the matrix model discussed in section 3.3 if we identify o’(x) = p(x). Therefore 
analysis of this matrix model is equivalent to that of [? ]. But in this section we reconsider the 
result of the gauge theory from the viewpoint of the matrix model. 


We can introduce a regular function on the complex plane, except at the infinity, 


Pa(x) = A" (e” eu?) = A" (w *) (94) 


w 
It is because the saddle point equation (83) yields the following equation, 
ey (x+ie)/2 +4 ey (xt+ie)/2 _ ey (x—ie) /2 + ey (x—ie)/2 (95) 


This entire function turns out to be a monic polynomial P,(x) = x" + ---, because it is an 
analytic function with the following asymptotic behavior, 
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n 
Ate/2 = ate eo) € = “) ay gl x —> 00 96 
II x (96) 
Here w should be the smaller root with the boundary condition as 
At 
ne x —> 00 (97) 
thus we now identify 
w=e 9? (98) 


Therefore from the hyperelliptic curve (94) we can relate Seiberg-Witten curve to the spectral 
curve of the matrix model, 

1 Pi 

2ni w 


1 
= 99 
ona log w dx (99) 


1 

= anil 42 

Note that it is shown in [37, 38] we have to take the vanishing fraction limit to obtain the 
Coulomb moduli from the matrix model contour integral. This is the essential difference 
between the profile function method and the matrix model description. 


4.3. Eigenvalue distribution 


We now demonstrate that the eigenvalue distribution function is indeed derived from the 
spectral curve of the matrix model. The spectral curve (94) in the case of n = 1 with setting 
A = Land P,,—1(x) = x is written as 


1 
x=wt+ = (100) 


From this relation the singular part of the resolvent can be extracted as 
Wsing(x) = arccosh (5) (101) 


This has a branch cut only on x € [—2,2], namely a one-cut solution. Thus the eigenvalue 
distribution function is witten as follows at least on x € [—2,2], 


p(x) = ~ arceos (5) (102) 


Note that this function has a non-zero value at the left boundary of the cut, o(—2) = 1, while at 
the right boundary we have (2) = 0. Equivalently we now choose the cut of arccos function 
in this way. This seems a little bit strange because the eigenvalue density has to vanish except 
for on the cut. On the other hand, recalling the meaning of the eigenvalues, i.e. positions of 
one-dimensional particles, as shown in Fig. 4, this situation is quite reasonable. The region 
below the Fermi level is filled of the particles, and thus the density has to be a non-zero 
constant in such a region. This is just a property of the Fermi distribution function. (1/N 
correction could be interpreted as a finite temperature effect.) Therefore the total eigenvalue 
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p(x) 
1 


Figure 6. The eigenvalue distribution function for the U(1) model. 


distribution function is given by 


1 x<-2 
p(x) = 4 darccos(%) |x| <2 (103) 
x>2 


Remark the eigenvalue density (103) is quite similar to the Wigner’s semi-circle distribution 
function, especially its behavior around the edge, 


1 x\2 1 
Peire(*) = = (5) Move, #52 (104) 


The fluctuation at the spectral edge of the random matrix obeys Tracy-Widom distribution 
[56], thus it is natural that the edge fluctuation of the combinatorial model is also described 
by Tracy-Widom distribution. This remarkable fact was actually shown by [9]. Evolving such 
a similarity to the gaussian random matrix theory, the kernel of this model is also given by the 
following sine kernel, 


sin po7t(x — y) 
K(x, y) = ————_——+ (105) 
= Tey) 
where o is the averaged density of eigenvalues. This means the U(1) combinatorial model 
belongs to the GUE random matrix universal class [40]. Then all the correlation functions can 
be written as a determinant of this kernel, 
p(x1,-++ ,X,) = det [K(xi,))| (106) 


1Si,j,<k 


Let us then remark a relation to the profile function of the Young diagram. It was shown 
that the shape of the Young diagram goes to the following form in the thermodynamical limit 


[33, 58, 59], 
2 (xaresin § +V4— x?) |x| <2 


O(x) = 
o |x| |x| > 2 


(107) 


Rather than this profile function itself, the derivative of this function is more relevant to our 
study, 
-1 x<-2 
Of(x) = ¢ 2 aresin(#) |x| <2 (108) 
1 x>2 
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One can see the eigenvalue density (103) is directly related to this derivative function (108) as 


p(x) = os (109) 


This relation is easily obtained from the correspondence between the Young diagram and the 
one-dimensional particle as shown in Fig. 4. 


5. Conclusion 


In this article we have investigated the combinatorial statistical model through its matrix 
model description. Starting from the U(1) model, which is motivated by representation 
theory, we have dealt with its 6-deformation and q-deformation. We have shown that 
its non-Abelian generalization, including external field parameters, is obtained as the four 
dimensional supersymmetric gauge theory partition function. We have also referred to the 
orbifold partition function, and its relation to the q-deformed model through the root of unity 
limit. 

We have then shown the matrix integral representation is derived from such a combinatorial 
partition function by considering its asymptotic behavior in the large N limit. Due to variety of 
the combinatorial model, we can obtain the 6-ensemble matrix model, the hyperbolic matrix 
model, and those with external fields. Furthermore from the orbifold partition function the 
multi-matrix model is derived. 


Based on the matrix model description, we have study the asymptotic behavior of the 
combinatorial models in the large N limit. In this limit we can extract various important 
properties of the matrix model by analysing the saddle point equation. Introducing the 
resolvent as an auxiliary function, we have obtained the algebraic curve for the matrix model, 
which is called the spectral curve. We have shown it can be interpreted as Seiberg-Witten 
curve, and then the eigenvalue distribution function is also obtained from this algebraic curve. 


Let us comment on some possibilities of generalization and perspective. As discussed in this 
article we can obtain various interesting results from Macdonald polynomial by taking the 
corresponding limit. It is interesting to research its matrix model consequence from the exotic 
limit of Macdonald polynomial. For example, the q — 0 limit of Macdonald polynomial, 
which is called Hall-Littlewood polynomial, is not investigated with respect to its connection 
with the matrix model. We also would like to study properties of the BC-type polynomial 
[31], which is associated with the corresponding root system. Recalling the meaning of the 
q-deformation in terms of the gauge theory, namely lifting up to the five dimensional theory 
R* x s! by taking into account all the Kaluza-Klein modes, it seems interesting to study the six 
dimensional theory on R* x T?. In this case it is natural to obtain the elliptic generalization 
of the matrix model. It can not be interpreted as matrix integral representation any longer, 
however the large N analysis could be anyway performed in the standard manner. We would 
like to expect further develpopment beyond this work. 
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1. Introduction 


Nonnegative matrices have long been a sorce of interesting and challenging mathematical 
problems. They are real matrices with all their entries being nonnegative and arise in a number 
of important application areas: communications systems, biological systems, economics, 
ecology, computer sciences, machine learning, and many other engineering systems. Inverse 
eigenvalue problems constitute an important subclass of inverse problems that arise in the 
context of mathematical modeling and parameter identification. A simple application of such 
problems is the construction of Leontief models in economics [1]-[3]. 


The nonnegative inverse eigenvalue problem (NIEP) is the problem of characterizing those lists 
A = {A1,A2,...,An} of complex numbers which can be the spectra of n x n entrywise 
nonnegative matrices. If there exists a nonnegative matrix A with spectrum A we say that 
A is realized by A and that A is the realizing matrix. A set K of conditions is said to be a 
realizability criterion if any list A = {Aj,A2,...,An}, real or complex, satisfying conditions K 
is realizable. The NIEP is an open problem. A full solution is unlikely in the near future. The 
problem has only been solved for n = 3 by Loewy and London ([4], 1978) and for n = 4 
by Meehan ([5], 1998) and Torre-Mayo et al.([6], 2007). The case n = 5 has been solved for 
matrices of trace zero in ([7], 1999). Other results, mostly in terms of sufficient conditions 
for the problem to have a solution (in the case of a complex list A), have been obtained, in 
chronological order, in [8]-[13]. 


Two main subproblems of the NIEP are of great interest: the real nonnegative inverse eigenvalue 
problem (RNIEP), in which A is a list of real numbers, and the symmetric nonnegative inverse 
eigenvalue problem (SNIEP), in which the realizing matrix must be symmetric. Both problems, 
RNIEP and SNIEP are equivalent for n < 4 (see [14]), but they are different otherwise (see 
[15]). Moreover, both problems remains unsolved for n > 5. The NIEP is also of interest 
for nonnegative matrices with a particular structure, like stochastic and doubly stochastic, 
circulant, persymmetric, centrosymmetric, Hermitian, Toeplitz, etc. 


The first sufficient conditions for the existence of a nonnegative matrix with a given real 
spectrum (RNIEP) were obtained by Suleimanova ([16], 1949) and Perfect ([17, 18], 1953 and 
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1955). Other sufficient conditions have also been obtained, in chronological order in [19]-[26], 
(see also [27, 28], and references therein for a comprehensive survey). 


The first sufficient conditions for the SNIEP were obtained by Fiedler ([29], 1974). Other results 
for symmetric realizability have been obtained in [8, 30] and [31]-[33]. Recently, new sufficient 
conditions for the SNIEP have been given in [34]-[37]. 


1.1. Necessary conditions 


Let A be a nonnegative matrix with spectrum A = {A1,A9,...,An}. Then, from the Perron 
Frobenius theory we have the following basic necessary conditions 


@\ A= eal SK 
(2) maxj{|Aj]} © A (1) 
(3) sm(A) = i Aj >0,m=1,2,..., 


where A = A means that A is closed under complex comjugation. 
Moreover, we have 


(4) (sp(A))™ < n"—18p,(A), k,m =1,2,... (2) 
(5) (so(A))? < (n—1)sq(A), 1 odd, tr(A) = 0. 


Necessary condition (4) is due to Loewy and London [4]. Necessary condition (5), which 
is a refinement of (4), is due to Laffey and Meehan [38]. The list A = {5,4,—3, —3, —3} for 
instance, satisfies all above necessary conditions, except condition (5). Therefore A is not a 
realizable list. In [39] it was obtained a new necessary condition, which is independent of the 
previous ones. This result is based on the Newton’s inequalities associated to the normalized 
coefficients of the characteristic polynomial of an M-matrix or an inverse M-matrix. 


The chapter is organized as follows: In section 2 we introduce two important matrix results, 
due to Brauer and Rado, which have allowed to obtain many of the most general sufficient 
conditions for the RNIEP, the SNIEP and the complex case. In section 3 we consider the real 
case and we introduce, without proof (we indicate where the the proofs can be found), two 
sufficient conditions with illustrative examples. We consider, in section 4, the symmetric case. 
Here we introduce a symmetric version of the Rado result, Theorem 2, and we set, without 
proof (see the appropriate references), three sufficient conditions, which are, as far as we know, 
the most general sufficient conditions for the SNIEP. In section 5, we discuss the complex (non 
real) case. Here we present several results with illustrative examples. Section 6 is devoted to 
discuss some Fiedler results and Guo results, which are very related with the problem and 
have been employed with success to derive sufficient conditions. Finally, in section 7, we 
introduce some open questions. 


2. Brauer and Rado Theorems 

A real matrix A = (a;;)'"_, is said to have constant row sums if all its rows sum up to the same 
n 

constant, say, a, that is, iy ayy = &, i = 1,...,n. The set of all real matrices with constant 


j=l 
row sums equal to « is denoted by CS,. It is clear that any matrix in CS, has eigenvector 
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e= (11s, 1)T corresponding to the eigenvalue a. Denote by e, the n—dimensional vector 
with one in the k — th position and zeros elsewhere. 


It is well known that the problem of finding a nonnegative matrix with spectrum A = 
{A1,---,An} is equivalent to the problem of finding a nonnegative matrix in CS,, with 
spectrum A (see [40]). This will allow us to exploit the advantages of two important theorems, 
Brauer Theorem and Rado Theorem, which will be introduced in this section. 


The spectra of circulant nonnegative matrices have been characterized in [9], while in [10], 
a simple complex generalization of Suleimanova result has been proved, and efficient and 
general sufficient conditions for the realizability of partitioned spectra, with the partition 
allowing some of its pieces to be nonrealizable, provided there are other pieces, which are 
realizable and, in certain way, compensate the nonnrealizability of the former, have been 
obtained. This is the procedure which we call negativity compensation. This strategy, based in 
the use of the following two perturbation results, together with the properties of real matrices 
with constant row sums, has proved to be successful. 


Theorem 1. Brauer [41] Let A be an n x n arbitrary matrix with eigenvalues Ay,...,An. Let 
v = (v4,...,0n)! an eigenvector of A associated with the eigenvalue Ay and let q = (q1,+--,Gn)? 
be any n-dimensional vector. Then the matrix A + vq! has eigenvalues A4,...,Ap—1,An + 
vg, Agi ++ An- 


Proof. Let U be ann X n nonsingular matrix such that 


Ay ect 

u-!AU = Ag 
ok 
An 


is an upper triangular matrix, where we choose the first column of U as v (U there exists from 
a well known result of Schur). Then, 
Ay t+ q'v Kore Ok 

91 92°°* Gn ee 
u-'(A+vq')U=U !AU+ u= A2 
ox 

An 

and the result follows. m This proof is due to Reams [42]. 
Theorem 2. Rado [18] Let A be an n x n arbitrary matrix with eigenvalues Ay,...,Ay and let 
O. = diag{Ay,...,Ar} for some r < n. Let X be ann x r matrix with rank r such that its columns 
X1,X2,...,Xr satisfy Ax; = Ajxj,i = 1,...,r. Let C bean r x n arbitrary matrix. Then the matrix 
A+ XC has eigenvalues ty,..-, Hy, Apsa,--+,An, Where Hy,..., pt, are eigenvalues of the matrix QO. + 
CX. 


Proof. Let S = [X | Y] a nonsingular matrix with sis Ee Then UX = I,, VY = In_+ and 
VX =0,UY =0.Let C = [Cy | Co], X = [Xt], Y = [}]. Then, since AX = XO, 


QO Ve 


SAS = H [XQ | AY] = k vay 


102 Linear Algebra — Theorems and Applications 


and 


: mate = quits: 1 AT, | ~ 
s xes = |f] [| cals =| é|[zul-[% 3 | 
Thus, 


S(A+XC)S =S1AS +S XCS = ik + CX MUAY + al 


0 VAY 
and we have 0(A + XC) = 0(Q+CX)+0(A)—o(Q). » 


3. Real nonnegative inverse eigenvalue problem. 


Regarding the RNIEP, by applying Brauer Theorem and Rado Theorem, efficient and general 
sufficient conditions have been obtained in [18, 22, 24, 36]. 


Theorem 3. [24] Let A = {A1,Ag,...,An} be a given list of real numbers. Suppose that: 
i) There exists a partition A = Aq U...U A¢, where 


Ag = {Ag Anas Akp tb) Att = An, At 2+ 2 Akpy An 2 9, 
k =1,...,t, such that for each sublist A; we associate a corresponding list 
Ty = { Wk, Akay 1 Arp, by 0 < Wk < M1, 


which is realizable by a nonnegative matrix Ax € CSy, of order pr. 

ii) There exists a nonnegative matrix B € CS), with eigenvalues A1,Ap1,..., An (the first elements of 
the lists A;) and diagonal entries w1,W,..., w+ (the first elements of the lists T,). 

Then A is realizable by a nonnegative matrix A € CS,,. 


Perfect [18] gave conditions under which 1, A2,...,Az and w1,W2,...,W+ are the eigenvalues 
and the diagonal entries, respectively, of a t x t nonnegative matrix B € CS),,. For t = 2 it is 
necessary and sufficient that Ay + Az = wy + w2, with O < w; < Ay. For t = 3 Perfect gave the 
following result: 


Theorem 4. [18] The real numbers Aj, A2,A3 and wy}, W,W3 are the eigenvalues and the diagonal 
entries, respectively, of a3 x 3 nonnegative matrix B € CS,,, if and only if: 


i Die Gy, cai 
ii) AyHAg+tA3 =W1 +02+03 


is 3 
iii) AyA2 + A4A3 + A2A3 < WyW2 + W1W3 + W2W3 (3) 
iv) max,wr > Ag 
Then, an appropriate 3 x 3 nonnegative matrix B is 
Wy 0 Ay Wy 
B=|]Aj-wo-p wy p j (4) 
0 Ay _—W3 W3 


where 
1 


a 


(ww + w1wW3 + W2W3 — AyA2 + AYA3 + Azz). 
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For t > 4, we only have a sufficient condition: 


i) O< up < Ay, K=1,2,...4, 
ii) Wy +W2-+>+w@p=A,+Ag--++At, (5) 
iii) Wk => Xz, Wy = Ak, k= 2,3, 00098) 


with the following matrix B € CS), having eigenvalues and diagonal entries A1,A2,...,At 
and w1,W2,..., Wt, respectively: 


= - & ; o) 
ee, ee ean 
Example 1. Let us consider the list A = {6,1,1, —4, —4} with the partition 
Ay 246, aby Koha}, Ay = Eh 
and the realizable associated lists 
T, = {4,-4}, To = {4,-4}, T3 = {0}. 


From (4) we compute the 3 x 3 nonnegative matrix 


with eigenvalues 6,1, 1, and diagonal entries 4,4,0. Then 


04000 100 
40000 100] [00002 
A= |00040/+/010] | 30003 
00400 010] [00600 
00000 001 
04002 
40002 
ieee 
2 2 
00600 


is nonnegative with spectrum A. 


A map of sufficient conditions for the RNIEP it was constructed in [28], There, the sufficient 
conditions were compared to establish inclusion or independence relations between them. It 
is also shown in [28] that the criterion given by Theorem 3 contains all realizability criteria 
for lists of real numbers studied therein. In [36], from a new special partition, Theorem 3 
is extended. Now, the first element Ax, of the sublist A, need not to be nonnegative and 
the realizable auxiliar list Ty = {wx,Ag, Akp contains one more element. Moreover, the 
number of lists of the partition depend on the number of elements of the first list A,, and some 
lists A; can be empty. 
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Theorem 5. [36] Let A = {Ay,Az,...,An} be a list of real numbers and let the partition A = 
Ay U+++U Ay, +1 be such that 


Ay = {Ag Anas Akp ds Ann = Av An 2 Age 2+ 2 Ap, 
k =1,...,p, +1, where py, is the number of elements of the list A, and some of the lists Ax can be 
empty. Let wo,...,Wp,+1 be real numbers satisfying 0 < we < A1,k = 2,...,p1 +1. Suppose that 
the following conditions hold: 
i) For eachk = 2,...,p, +1, there exists a nonnegative matrix Ay € CS, with spectrum Ty = 


{WK Akty er Arey by 
ii) There exists a py X p, nonnegative matrix B € CS), with spectrum A and with diagonal entries 


W2,..- Wy +1- 
Then A is realizable by a nonnegative matrix A € CS),. 


Example 2. With this extension, the authors show for instance, that the list 
{5,4,0, —3, —3, —3} 
is realizable, which can not be done from the criterion given by Theorem 3. In fact, let the partition 
Ay = {5,4,0,—-3}, Ao = {-3}, Ag = {-3} with 
T, = {3,—3}, T3 = {3,-3}, Tg =T5 = {0}. 
The nonnegative matrix 
3020 
0302 


3002 
0320 


has spectrum A, and diagonal entries 3,3,0,0. It is clear that 


B= 


Az = A3 = 3 | realizes T7 = T3. 


Then 
1000 030020 
Ao 1000 000020 300020 
co A3 4 0100 000002 _ 000302 
0 0100 300002 003002 
0 0010 003020 300002 
0001 003020 


has the desired spectrum {5,4,0, —3, —3, —3}. 


4. Symmetric nonnegative inverse eigenvalue problem 


Several realizability criteria which were first obtained for the RNIEP have later been shown 
to be symmetric realizability criteria as well. For example, Kellogg criterion [19] was showed 
by Fiedler [29] to imply symmetric realizability. It was proved by Radwan [8] that Borobia’s 
criterion [21] is also a symmetric realizability criterion, and it was proved in [33] that Soto’s 
criterion for the RNIEP is also a criterion for the SNIEP. In this section we shall consider the 
most general and efficient symmetric realizability criteria for the SNIEP (as far as we know 
they are). We start by introducing a symmetric version of the Rado Theorem:. 
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Theorem 6. [34] Let A be ann x n symmetric matrix with spectrum A = {A1,Az,...,An} and for 
some r <n, let {x1,x2,...,Xr} be an orthonormal set of eigenvectors of A spanning the invariant 
subspace associated with Ay,Az,...,Ar. Let X be the n x r matrix with i — th column x;, let OQ = 
diag{A,,...,A;y}, and let C be any r x r symmetric matrix. Then the symmetric matrix A + XCX™ 
has eigenvalues ty, [oy + +) Ups Artis-++,An, Where fly, ly,..-,H, are the eigenvalues of the matrix 
O+C. 


Proof. Since the columns of X are an orthonormal set, we may complete X to an orthogonal 
matrix W=(X Y]/thatis, X7X = 1, YY = f+, XY =0,Y"X =0. Then 


= xT Q XTAY 
1 = = 
w-lAw = yr Ax Y| = i vray | 
Z i CO 
1 T — r = 
w-'(xcx!)w = Age = ole 
Therefore, 
-1 Tin | Q+C XTAY 
Ww l(A+ xcx!)Ww= a ae 


and A+ XCX? is symmetric with eigenvalues }11,...,H,,Ap41,---,An- 
By using Theorem 6, the following sufficient condition was proved in [34]: 


Theorem 7. [34] Let A = {A,A2,...,An} bea list of real numbers with Ay > Ay > +++ > Ay and, 
for some t <n, let wy ,...,+ be real numbers satisfying 0 < wp < A1,k =1,...,t. Suppose there 
exists: 

i) a partition A = A,U---U A; with 


Ag = {Ag Atay Akp dt, Ann = Av A 20, Ags 2 Aka 2 ++ > Akpy, 


such that for each k = 1,...,t, the list Ty = {wp Apo, Akp is realizable by a symmetric 
nonnegative matrix Aj, of order px, and 

ii) a t x t symmetric nonnegative matrix B with eigenvalues A41,A1,...,Az,} and with diagonal 
entries W1,W2,...,Wt.- 

Then A is realizable by a symmetric nonnegative matrix. 


Proof. Since Ag is a pe X pe Symmetric nonnegative matrix realizing T;, then A = 
diag{A,,Az,...,A+} is symmetric nonnegative with spectrum Ty UT? U--- UT;. Let 
{x1,...,x¢} be an orthonormal set of eigenvectors of A associated with w},...,wt, 
respectively. Then the n x t matrix X with i — th column x; satisfies AX = XQ for 
OQ. = dig{w ,...,wt}. Moreover, X is entrywise nonnegative, since each x; contains the 
Perron eigenvector of A; and zeros. Now, if we set C = B — Q, the matrix C is symmetric 
nonnegative and © + C has eigenvalues Ay,...,A;. Therefore, by Theorem 6 the symmetric 
matrix A + XCXT has spectrum A. Besides, it is nonnegative since all the entries of A, X, and 
C are nonnegative. = 


Theorem 7 not only ensures the existence of a realizing matrix, but it also allows to construct 
the realizing matrix. Of course, the key is to know under which conditions does there exists 
a t x t symmetrix nonnegative matrix B with eigenvalues A1,...,A; and diagonal entries 
W1,.++,Wt. 
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The following conditions for the existence of a real symmetric matrix, not necessarily 
nonnegative, with prescribed eigenvalues and diagonal entries are due to Horn [43]: There 


exists a real symmetric matrix with eigenvalues 41 > Ay > -++ > At and diagonal entries 
W, > wW2 >--+ > wo; if and only if 


k k 
Vare wp k= 1.gt= 1 
ae aa 

a => Ye 0; 

i=1 i=1 


For ft = 2, the conditions (7) become 


(7) 


Aya, AV+AQ = 1 +02, 


and they are also sufficient for the existence of a 2 x 2 symmetric nonnegative matrix B with 
eigenvalues A; > Az and diagonal entries w 1 > w2 > 0, namely, 


B= wy V (Aq — @1)(A1 — 2) 
V(A1 — @1) (Ar — 2) w2 


For t = 3, we have the following conditions: 


Lemma 1. [29] The conditions 


Ay 2 wy 
Ay t+Az2 > WwW, +2 
Ay tAg+A3 = 01, +02+03 
Ww, 2 Ag 


(8) 
are necessary and sufficient for the existence of a 3 x 3 symmetric nonnegative matrix B with 
eigenvalues A, > Ay > Az and diagonal entries wy > wz > w3 > 0. 


In [34], the following symmetric nonnegative matrix B, satisfying conditions (8), it was 


constructed: 
H=W3 H—W2 
oa \/ 2i—a@n—w3 > Vans 
A — WW) te. = Ae 
B= \/ Wg a a8 Ww2 Vu _ w2) (Ht > w3) y (9) 


a ae VE Ge) w3 


where pf = Ay + Ag — wy; 8 = (Ar — w)(A1 — 1). 
For t > 4 we have only a sufficient condition: 


Theorem 8. Fiedler [29] If A, > --- > Ag and wy, > +--+ > w+ satisfy 


i) yA > wi, s=1,...,t-1 
ae ae 
, 10 
ii) Vai = Vi 0; a) 
i=1 i=1 


iti) wy_y > Ap, K=2,...,t-1 
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then there exists at x t symmetric nonnegative matrix with eigenvalues A1,...,A¢ and diagonal entries 
W1,..+,Wt. 


Observe that 


w 

II 
NIBNIF NOI 
NIRNIF O71 NO 
NO OINeENIFE 
Ol NNBENIFE 


has eigenvalues 8,6,3,3, but Az =6 >5 =a}. 

Example 3. Let us consider the list A = {7,5,1, —3, —4, —6} with the partition 
Ay = {7, —6}, Ao = {5, Al, A3 = {L —3} with 
T, _ {6, —6}, T> = {4, A}, T3 = {3, —3}. 


We look for a symmetric nonnegative matrix B with eigenvalues 7,5,1 and diagonal entries 6, 4,3. 
Then conditions (8) are satisfied and from (9) we compute 


3/2 

6 3 5 
B= TE 4 6 | andC=B-Q, 
2 


/3 ve 3 


where O. = diag{6,4,3}. The symmetric matrices 


realize T,,12,13. Then 


2 

ae 0 

2 

a 
sa 0 2 9 

A= Ap + XCX!, where X = 4 , 
0 2 0 
A3 2 
00¥ 
00¥ 


is symmetric nonnegative with spectrum A. 
In the same way as Theorem 3 was extended to Theorem 5 (in the real case), Theorem 7 was 
also extended to the following result: 


Theorem 9. [36] Let A = {A1,Az,...,An} be a list of real numbers and let the partition A = 
Ay U+++U Ay, 4) be such that 


Ak = {An Aka Apt) Att = Ad, Ant 2 Aka 2-0 = Akpy, 


k=1,...,p, +1, where A, is symmetrically realizable, p, is the number of elements of Ay and some 
lists Ax can be empty. Let w,...,Wp,+41 be real numbers satisfying 0 < wp < Aq, k = 2,...,p1 +1. 
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Suppose that the following conditions hold: 

i) For each k = 2,...,p, +1, there exists a symmetric nonnegative matrix A; with spectrum T, = 
{Wy Akty er Ady by 

ii) There exists a py x p, symmetric nonnegative matrix B with spectrum A, and with diagonal entries 
W2,+.. Wy +1- 

Then A is symmetrically realizable. 


Example 4. Now, from Theorem 9, we can see that there exists a symmetric nonnegative matrix 
with spectrum A = {5,4,0, —3,—3, —3}, which can not be seen from Theorem 7. Moreover, we can 
compute a realizing matrix. In fact, let the partition 


Ay = {5,4,0,-3}, Ap = {—3}, Ag = {—3} with 
T = {3,-3}, Ts = (3, -3}, Ty =T's = {0}. 


The symmetric nonnegative matrix 


has spectrum A, and diagonal entries 3,3,0,0. Let O. = diag{3,3,0,0} and 


0 00 


0 00 


v2 90 

2 , Az = Ag 
2 

Y 00 

010 


001 


ocoo°o SNS 


Then, from Theorem 6 we obtain 


which is symmetric nonnegative with spectrum A. 


The following result, although is not written in the fashion of a sufficient condition, is indeed 
a very general and efficient sufficient condition for the SNIEP. 


Theorem 10. [35] Let A be an n x n irreducible symmetric nonnegative matrix with spectrum 
A = {Aj,A2,...,An}, Perron eigenvalue A1 and a diagonal element c. Let B be an m x m symmetric 
nonnegative matrix with spectrum T = {f1y,My,--+, 4, and Perron eigenvalue p1,. 

i) If uy <c, then there exists a symmetric nonnegative matrix C, of order (n +m — 1), with spectrum 
{Aine xg Ans Horses sia he 
ii) If 4 = c, then there exists a symmetric nonnegative matrix C, of order (n +m —1), with spectrum 
{Aq + fy — Cp A2, Ans Hore Hig} 
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Example 5. The following example, given in [35], shows that 
{7,5,0,—4, —4, —4} with the partition 
A = {7,5,0,—4}, T = {4,-4}, 


satisfies conditions of Theorem 10, where 


40 b 0 

_ |e eo a ie, tee 2 

A= bowie with b2 +d? = 23, bd =4V6, 
0d v6 0 


is symmetric nonnegative with spectrum A. Then there exists a symmetric nonnegative matrix C 
with spectrum {7,5,0, —4, —4} and a diagonal element 4. By applying again Theorem 10 to the lists 
{7,5,0, —4, —4} and {4, —4}, we obtain the desired symmetric nonnegative matrix. 


It is not hard to show that both results, Theorem 9 and Theorem 10, are equivalent (see [44)). 
Thus, the list in the Example 4 is also realizable from Theorem 10, while the list in the example 
5 is also realizable from Theorem 9. 


5. List of complex numbers 


In this section we consider lists of complex nonreal numbers. We start with a complex 
generalization of a well known result of Suleimanova, usually considered as one of the 
important results in the RNIEP (see [16]): The list Ay > 0 > Az > --- > Ay is the spectrum of a 
nonnegative matrix if and only if Ay + Az +---+An > 0. 


Theorem 11. [10] Let A = {Ap,Ay,...,An} be a list of complex numbers closed under complex 
conjugation, with 


A! = {Ay,...,An} C {2 €C: Rez <0; |Rez| > |Imz|}. 


n 
Then A is realizable if and only if sy Aj = 0. 
i=0 


Proof. Suppose that the elements of A! are ordered in such a way that A2p+ ...,Ay are real 
and A1,..., A2p are complex nonreal, with 


Xp = ReAnp_1 = ReAy, and y, = ImAy,_1 = IMA, 
fork = 1,...,p. Consider the matrix 


0 0 0 
—Xy ty 1 -Y1 
—y Wy M1 


B= |—Xp+¥Yp 0 OO . Xp —Yp 


—Xp—Yp 0 O . Yp Xp 


—Aspi1 O O. Aop+i 


—hyn 0 ‘ An 
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It is clear that B € CSo with spectrum {0,A1,...,An} and all the entries on its first column are 


n 
nonnegative. Define q = (q0,91,---,4n)! with qo = Ag + yo Aj and 
i=1 


9x = — RedAg fork =1,...,2p and qx = —Ax fork = 2p+1,...,n. 
Then, from the Brauer Theorem 1 A = B + eq! is nonnegative with spectrum A. m 


In the case when all numbers in the given list, except one (the Perron eigenvalue), have real 
parts smaller than or equal to zero, remarkably simple necessary and sufficient conditions 
were obtained in [11]. 


Theorem 12. [11] Let Ay, A3,...,An be nonzero complex numbers with real parts less than or equal to 
zero and let Aj be a positive real number. Then the list A = {Az,A,...,An} is the nonzero spectrum 
of a nonnegative matrix if the following conditions are satisfied: 


i) A=A 
n 
ii) 35 =) A; >0 
dL ; (11) 
n 
iii) sy =) A? >0 
i=l 


The minimal number of zeros that need to be added to A to make it realizable is the smallest nonnegative 
integer N for which the following inequality is satisfied: 


s < (n+ N)so. 


Furthermore, the list {Ay,Az,...,An,0,...,0} can be realized by C + x1, where C is a nonnegative 
companion matrix with trace zero, « is a nonnegative scalar and I is the n x n identity matrix. 


Corollary 1. [11] Let Az, A3,...,An be complex numbers with real parts less than or equal to zero and 
let A, be a positive real number. Then the list A = {A1,Az,...,An} is the spectrum of a nonnegative 
matrix if and only if the following conditions are satisfied: 


i) A=A 
ii) $1 = Le A; >0 
es (12) 


Example 6. The list A = {8,-—1+ 3i,—1— 3i,—2+ 5i,—2 — 5i} satisfies conditions (12). Then A 
is the spectrum of the nonnegative companion matrix 


0 1 0 00 
0 0O 1 00 
C= 0 0 010 
0 0 001 


2320 494 278 12 
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Observe that Theorem 11 gives no information about the realizability of A. 

The list {19,-1+ 11i,-1—11i, —3 + 8i,-—3 — 8} was given in [11]. It does not satisfy conditions 
(12): 8; = 11,82 = 1land s? < nso. The inequality 117 < (5+ N)11 is satisfied for N > 6. Then 
we need to add 6 zeros to the list to make it realizable. 


Theorem 3 (in section 3), can also be extended to the complex case: 
Theorem 13. [13] Let A = {Ao,A3,...,An} be a list of complex numbers such that A = A, Ay > 
n 
max; |A;|,i = 2,...,n,and om A; > 0. Suppose that: 
z= 1. 
i) there exists a partition A = Ay U---U Ay with 
Ag = {An Ana» Akp Ant = An, 


k =1,...,t, such that Ty = {wp Aga, Xp t is realizable by a nonnegative matrix Ay € CSa, 
and 

ii) there exists at x t nonnegative matrix B € CS),, with eigenvalues 

Ay, A21,-+.,Agy (the first elements of the lists Aj.) and with diagonal entries w 1, w2,..., cw» (the Perron 
eigenvalues of the lists T,.). 

Then A is realizable. 


Example 7. Let A = {7,1, —2,—2,—2 + 4i, —2 — 4i}. Consider the partition 
Aq = {7,1,-2,-2}, Ao = {-2+4i}, Ag ={-2—44} with 
T, = {3,1,-—2,-2}, T2 = {0}, T3 = {0}. 


We look for a nonnegative matrix B € CS7 with eigenvalues 7, —2 + 4i, —2 — 4i and diagonal entries 
3,0,0, and a nonnegative matrix A, realizing 11. They are 


3 04 2001 
_ 41 8 = 
B= a and A, = 0102 
0120 
Then 
100 020104 
At 100) Py annga 200104 
100) Ia i 010204 
A= 0 | + 200008 |= 

; ere 012004 
010 400008 
001 000070 


has the spectrum A. 


6. Fiedler and Guo results 


One of the most important works about the SNIEP is due to Fiedler [29]. In [29] Fiedler 
showed, as it was said before, that Kellogg sufficient conditions for the RNIEP are also 
sufficient for the SNIEP. Three important and very useful results of Fiedler are: 
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Lemma 2. [29] Let A be a symmetric m x m matrix with eigenvalues a1,...,m, AU = «1, ||u|| = 
1. Let B be a symmetric n x n matrix with eigenvalues B,,...,B,,, Bv = B1v, ||v|| = 1. Then for any 


p, the matrix 
_f[ A puv? 
— Ee B | 


has eigenvalues w2,..., 0m, Bo, -++1Byr Vis Yo, Where 1, V2 are eigenvalues of the matrix 


a} e 
C= : 
| p | 
Lemma 3. [29] If {a1,...,4m} and {B,,...,,,} are lists symmetrically realizable and x, > B,, 
then for any t > O, the list 


{a1 +t, By — t,02,..., 0m, Bo,---, By, 
is also symmetrically realizable. 


Lemma 4. [29] If A = {A1,A2,...,An} is symmetrically realizable by a nonnegative matrix and if 
t > 0, then 
Ap = {Ay +4,A2,...,An} 


is symmetrically realizable by a positive matrix. 


Remark 1. It is not hard to see that Lemma 2 can be obtained from Theorem 6. In fact, it is enough to 


consider 
_[A u0] [0p] fut oF 
c= |*5]+[o0] [6] [oro 


_[ A puv? 
~ fovul B |’ 


which is symmetric with eigenvalues ¥1,V2,%2,--+, 0m, Bo,+++1 By, Where V1, V2 are eigenvalues of 


Now we consider a relevant result due to Guo [45]: 


Theorem 14. [45] If the list of complex numbers A = {A1,A2,...,An} is realizable, where A, is the 
Perron eigenvalue and Az € R, then for any t > 0 the list Ap = {Ay + t,A2 £t,Az,...,An} is also 
realizable. 


n 
Corollary 2. [45] If the list of real numbers A = {A1,A2,...,An} is realizable and ty = = |t;| with 
1=2 


t; € R,i=2,...,n, then the list Ap = {Aq + t1,A2 + to,...,An + tr} is also realizable. 


Example 8. Let A = {8,6,3,3,—5,—5,—5,—5} be a given list. Since the lists Ay = Ag = 
{7,3, —5, —5} are both realizable (see [22] to apply a simple criterion, which shows the realizability 
of A, = Ad), then 

Ay UA: = {7,7,3,3, —5, —5, —5, —5} 


is also realizable. Now, from Theorem 14, with t = 1, A is realizable. 


Nonnegative Inverse Eigenvalue Problem 


Guo also sets the following two questions: 

Question 1: Do complex eigenvalues of nonnegative matrices have a property similar to 
Theorem 14? 

Question 2: If the list A = {A1,A,...,An} is symmetrically realizable, and t > 0, is the list 
Ag = {Ar +t,A2 £t,A3,...,An} symmetrically realizable?. 


It was shown in [12] and also in [46] that Question 1 has an affirmative answer. 


Theorem 15. [12] Let A = {A1,a + bi,a — bi,Ag,...,An} be a realizable list of complex numbers. 
Then for all t > 0, the perturbed list 


At = {A, +2t,a—t4 bi,a—t—Di,Ag,...,An} 


is also realizable. 


Question 2, however, remains open. An affirmative answer to Question 2, in the case that 
the symmetric realizing matrix is a nonnegative circulant matrix or it is a nonnegative left 
circulant matrix, it was given in [47]. The use of circulant matrices has been shown to be very 
useful for the NIEP [9, 24]. In [24] it was given a necessary and sufficient condition for a list 
of 5 real numbers, which corresponds to a even-conjugate vector, to be the spectrum of 5 x 5 
symmetric nonnegative circulant matrix: 


Lemma 5. [24] Let A = (A1,A2,A3,A3, Ag)? be a vector of real numbers (even-conjugate) such that 


j], 1=2,3 
Ay 2AQzAsz3 (13) 
Ay +2A2+2A3 > 0 


A necessary and sufficient condition for {A1,A2,A3,A3,A2} to be the spectrum of a symmetric 
nonnegative circulant matrix is 


An > 0. (14) 


Example 9. From Lemma 5 we may know, for instance, that the list 
{6,1,1, —4, —4} is the spectrum of a symmetric nonnegative circulant matrix. 


7. Some open questions 
We finish this chapter by setting two open questions: 


Question 1: If the list of real numbers A = {Ay,A2,...,An} is symmetrically realizable, and t > 0, 
is the list Ay = {A, +t, Az £t,Az3,...,An} also symmetrically realizable? 


Some progress has been done about this question. In [47], it was given an affirmative answer 
to Question 1, in the case that the realizing matrix is symmetric nonnegative circulant matrix 
or it is nonnegative left circulant matrix. In [48] it was shown that if 1 > Ay > --- > 
An = 0, then Theorem 14 holds for positive stochastic, positive doubly stochastic and positive 
symmetric matrices. 


Question 2: How adding one or more zeros to a list can lead to its symmetric realizability by different 
symmetric patterned matrices? 
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The famous Boyle-Handelman Theorem [49] gives a nonconstructive proof of the fact that if 
sy = AK + as pero tf AK > 0, fork = 1,2,..., then there exists a nonnegative number N 
for which the list {A1,...,An,0,...,0}, with N zeros added, is realizable. In [11] Laffey and 
Smigoc completely solve the NIEP for lists of complex numbers A = {Aj,...,An}, closed 
under conjugation, with A2,...,An having real parts smaller than or equal to zero. They show 
the existence of N > 0 for which A with N zeros added is realizable and show how to compute 
the least such N. The situation for symmetrically realizable spectra is different and even less 
is known. 


8. Conclusion 


The nonnegative inverse eigenvalue problem is an open and difficult problem. A full solution 
is unlikely in the near future. A number of partial results are known in the literature about 
the problem, most of them in terms of sufficient conditions. Some matrix results, like Brauer 
Theorem (Theorem 1), Rado Theorem (Theorem 2), and its symmetric version (Theorem 6) 
have been shown to be very useful to derive good sufficient conditions. This way, however, 
seems to be quite narrow and may be other techniques should be explored and applied. 
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1. Introduction 


The realization of a discrete-time, linear, time-invariant (LTT) filter from its impulse response 
provides insight into the role of linear algebra in the analysis of both dynamical systems and 
rational functions. For an LT] filter, a sequence of output data measured over some finite 
period of time may be expressed as the linear combination of the past input and the input 
measured over that same period. For a finite-dimensional LT] filter, the mapping from past 
input to future output is a finite-rank linear operator, and the effect of past input, that is, the 
memory of the system, may be represented as a finite-dimensional vector. This vector is the 
state of the system. 


The central idea of realization theory is to first identify the mapping from past input to future 
output and to then factor it into two parts: a map from the input to the state and another from 
the state to the output. This factorization guarantees that the resulting system representation 
is both casual and finite-dimensional; thus it can be physically constructed, or realized. 


System identification is the science of constructing dynamic models from experimentally 
measured data. Realization-based identification methods construct models by estimating the 
mapping from past input to future output based on this measured data. The non-deterministic 
nature of the estimation process causes this mapping to have an arbitrarily large rank, and so 
a rank-reduction step is required to factor the mapping into a suitable state-space model. Both 
these steps must be carefully considered to guarantee unbiased estimates of dynamic systems. 


The foundations of realization theory are primarily due to Kalman and first appear in the 
landmark paper of [1], though the problem is not defined explicitly until [2], which also coins 
the term “realization” as being the a state-space model of a linear system constructed from an 
experimentally measured impulse response. It was [3] that introduced the structured-matrix 
approach now synonymous with the term “realization theory” by re-interpreting a theorem 
originally due to [4] in a state-space LTI system framework. 


Although Kalman’s original definition of “realization” implied an identification problem, it 
was not until [5] proposed rank-reduction by means of the singular-value decomposition 
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that Ho’s method became feasible for use with non-deterministic data sets. The combination 
of Ho’s method and the singular-value decomposition was finally generalized to use with 
experimentally measured data by Kung in [6]. 


With the arrival of Kung’s method came the birth of what is now known as the field of 
subspace identification methods. These methods use structured matrices of arbitrary input 
and output data to estimate a state-sequence from the system. The system is then identified 
from the propagation of the state over time. While many subspace methods exist, the most 
popular are the Multivariable Output-Error State Space (MOESP) family, due to [7], and the 
Numerical Algorithms for Subspace State-Space System Identification (N4SID) family, due to 
[8]. Related to subspace methods is the Eigensystem Realization Algorithm [9], which applies 
Kung’s algorithm to impulse-response estimates, which are typically estimated through an 
Observer / Kalman Filter Identification (OKID) algorithm [10]. 


This chapter presents the central theory behind realization-based system identification in a 
chronological context, beginning with Kronecker’s theorem, proceeding through the work of 
Kalman and Kung, and presenting a generalization of the procedure to arbitrary sets of data. 
This journey provides an interesting perspective on the original role of linear algebra in the 
analysis of rational functions and highlights the similarities of the different representations 
of LTT filters. Realization theory is a diverse field that connects many tools of linear algebra, 
including structured matrices, the QR-decomposition, the singular-value decomposition, and 
linear least-squares problems. 


2. Transfer-function representations 


We begin by reviewing some properties of discrete-time linear filters, focusing on the role of 
infinite series expansions in analyzing the properties of rational functions. The reconstruction 
of a transfer function from an infinite impulse response is equivalent to the reconstruction of a 
rational function from its Laurent series expansion. The reconstruction problem is introduced 
and solved by forming structured matrices of impulse-response coefficients. 


2.1. Difference equations and transfer functions 


Discrete-time linear filters are most frequently encountered in the form of difference equations 
that relate an input signal u, to an output signal y,. A simple example is an output y; 
determined by a weighted sum of the inputs from ux to uUg_ jy, 


Yk = Vine + By —yUp_y + +++ + D0UR—m- (1) 


More commonly, the output y; also contains a weighted sum of previous outputs, such as a 
weighted sum of samples from yx_1 to Y¢_n, 


Yk = Dintty + Dm — {U1 +++ + DoUK—m — An—1Yk—1 — An—2Yk—2 — ++ + OY K—n- (2) 


The impulse response of a filter is the output sequence g, = yx generated from an input 


1 k=0, 
m= k £0. me 
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The parameters 9; are the impulse-response coefficients, and they completely describe the 
behavior of an LT] filter through the convolution 


Ye = Yo 8Mn-}- (4) 
j=0 


Filters of type (1) are called finite-impulse response (FIR) filters because g; is a finite-length 
sequence that settles to 0 once k > m. Filters of type (2) are called infinite impulse response 
(IIR) filters since generally the impulse response will never completely settle to 0. 


A system is stable if a bounded u, results in a bounded y;. Because the output of LIT filters 
is a linear combination of the input and previous output, any input-output sequence can be 
formed from a linear superposition of other input-output sequences. Hence proving that the 
system has a bounded output for a single input sequence is necessary and sufficient to prove 
the stability of an LIT filter. The simplest input to consider is an impulse, and so a suitable 
definition of system stability is that the absolute sum of the impulse response is bounded, 


lee} 


Y lgkl < ©. (5) 


k=0 


Though the impulse response completely describes the behavior of an LTI filter, it does so 
with an infinite number of parameters. For this reason, discrete-time LIT filters are often 
written as transfer functions of a complex variable z. This enables analysis of filter stability 
and computation of the filter’s frequency response in a finite number of calculations, and it 
simplifies convolution operations into basic polynomial algebra. 


The transfer function is found by grouping output and input terms together and taking the 
Z-transform of both signals. Let Y(z) = Le. ypz * be the Z-transform of y,; and U(z) be 
the Z-transform of u,. From the property 


Z [yal =Y(2)z7 
the relationship between Y(z) and U(z) may be expressed in polynomials of z as 
a(z)Y¥(z) = b(z)U(z). 
The ratio of these two polynomials is the filter’s transfer function 


B(z) Bz + By_yz™ 1 +--+ +b1z + bo 
a(z) Zl tay yzt-14.--+ajztay - 


(6) 


When n > m, G(z) is proper. If the transfer function is not proper, then the difference 
equations will have y; dependent on future input samples such as u;,,1. Proper transfer 
functions are required for causality, and thus all physical systems have proper transfer 
function representations. When n > m, the system is strictly proper. Filters with strictly proper 
transfer functions have no feed-through terms; the output y; does not depend on u;, only the 
preceding input uz_1, Uz_z, .... In this chapter, we assume all systems are causal and all 
transfer functions proper. 


If a(z) and b(z) have no common roots, then the rational function G(z) is coprime, and 
the order n of G(z) cannot be reduced. Fractional representations are not limited to 
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single-input-single-output systems. For vector-valued input signals u, € IR’ and output 
signals yz € IR’, an LTI filter may be represented as an ny X Ny matrix of rational functions 
Gj j(Z), and the system will have matrix-valued impulse-response coefficients. For simplicity, 
we will assume that transfer function representations are single-input-single-output, though 
all results presented here generalize to the multi-input-multi-output case. 


2.2. Stability of transfer function representations 


Because the effect of b(z) is equivalent to a finite-impulse response filter, the only requirement 
for b(z) to produce a stable system is that its coefficients be bounded, which we may safely 
assume is always the case. Thus the stability of a transfer function G(z) is determined entirely 
by a(z), or more precisely, the roots of a(z). To see this, suppose a(z) is factored into its roots, 
which are the poles p; of G(z), 
b(z 
G(z) = Fn ( 


i=1 (z _ Pi) 
To guarantee a bounded ¥;, it is sufficient to study a single pole, which we will denote simply 
as p. Thus we wish to determine necessary and sufficient conditions for stability of the system 


(7) 


1 
tle\ — 
G'(z) = a (8) 
Note that p may be complex. Assume that |z| > |p|. G’(z) then has the Laurent-series 
expansion 
' =i 1 1 F pky—k — J pk-1 pk 
G'(z) =z = ge”) pet =) poe: (9) 
1— pz k=0 k=1 
From the time-shift property of the z-transform, it is immediately clear that the sequence 
0 k=1 

i ‘ 10 
8k ie k>1 (10) 


is the impulse response of G’(z). If we require that (9) is absolutely summable and let |z| = 1, 
the result is the original stability requirement (5), which may be written in terms of p as 


ie) 
* || < 00. 
k=1 


This is true if and only if |p| < 1, and thus G’(z) is stable if and only if |p| < 1. Finally, from (7) 
we may deduce that a system is stable if and only if all the poles of G(z) satisfy the property 
pil <1. 


2.3. Construction of transfer functions from impulse responses 


Transfer functions are a convenient way of representing complex system dynamics in a finite 
number of parameters, but the coefficients of a(z) and b(z) cannot be measured directly. The 
impulse response of a system can be found experimentally by either direct measurement 
or from other means such as taking the inverse Fourier transform of a measured frequency 
response [11]. It cannot, however, be represented in a finite number of parameters. Thus the 
conversion between transfer functions and impulse responses is an extremely useful tool. 
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For a single-pole system such as (8), the expansion (9) provides an obvious means of 
reconstructing a transfer function from a measured impulse response: given any 2 sequential 
impulse-response coefficients g, and 2411, the pole of G’(z) may be found from 


P= 8, Sk+1- (11) 


Notice that this is true for any k, and the impulse response can be said to have a shift-invariant 
property in this respect. 


Less clear is the case when an impulse response is generated by a system with higher-order 
a(z) and b(z). In fact, there is no guarantee that an arbitrary impulse response is the result 
of a linear system of difference equations at all. For an LTI filter, however, the coefficients of 
the impulse response exhibit a linear dependence which may be used to not only verify the 
linearity of the system, but to construct a transfer function representation as well. The exact 
nature of this linear dependence may be found by forming a structured matrix of impulse 
response coefficients and examining its behavior when the indices of the coefficients are 
shifted forward by a single increment, similar to the single-pole case in (11). The result is 
stated in the following theorem, originally due to Kronecker [4] and adopted from the English 
translation of [12]. 


Theorem 1 (Kronecker’s Theorem). Suppose G(z) : C — C is an infinite series of descending 
powers of z, starting with z—1, 


G(z) = gz ly Q2z a4 832 B40. = Y gez*. (12) 
k=1 


Assume G(z) is analytic (the series converges) for all |z| > 1. Let H be an infinitely large matrix of the 
form 
&1 82 83 °°" 
&2 §3 84° °° 
H= | 3 ga 35 *>* (13) 


Then H has finite rank n if and only if G(z) is a strictly proper, coprime, rational function of degree n 


with poles inside the unit circle. That is, G(z) has an alternative representation 


bz + Dy zt +e + yz + bg 
a(z) ZY + ay,_yz!-14---+a,z+49 


(14) 


in which m <n, all roots of a(z) satisfy |z| < 1, a(z) and b(z) have no common roots, and we have 
assumed without loss of generality that a(z) is monic. 


To prove Theorem 1, we first prove that for k > n, g, must be linearly dependent on the 
previous n terms of the series for H to have finite rank. 


Theorem 2. The infinitely large matrix H is of finite rank n if and only if there exists a finite sequence 
W1, &2,+*+ ,Xy such that fork > n, 


n 
Skt = Dy %j8kjr (15) 
j=l 
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and n is the smallest number with this property. 


Proof. Let hy, be the row of H beginning with 9;. If H has rank n, then the first n + 1 rows of 
H are linearly dependent. This implies that for some 1 < p < n, hy +1 is a linear combination 
of hy,..., hp, and thus there exists some sequence a; such that 


P 
Ayy4 = 2 WjNy—j41- (16) 
j=l 


The structure and infinite size of H imply that such a relationship must hold for all following 
rows of H, so that for q > 0 


p 
Ngrpsi = Ly ajhqrp—j4- 
j=l 


Hence any row hx, k > p, can be expressed as a linear combination of the previous p rows. 
Since H has at least n linearly independent rows, p = n, and since this applies element-wise, 
rank(H) = n implies (15). 


Alternatively, (15) implies a relationship of the form (16) exists, and hence rank(H) = p. Since 
n is the smallest possible p, this implies rank(H) = n. 


We now prove Theorem 1. 


Proof. Suppose G(z) is a coprime rational function of the form (14) with series expansion 
(12), which we know exists, since G(z) is analytic for |z| < 1. Without loss of generality, let 
m =n —1,since we may always let b; = 0 for some k. Hence 


b,_1z"—! + by_oz"? +---+b1z+ bo _ 
Z 4a, yz? -1+.---+a,z+49 


giz) +goz 7 + 932 3 +--- 
Multiplying both sides by the denominator of the left, 


by_1z + by—oz"* 


ferret biz + bo 
= g2""1 + (go + 814n—1)2" > + (83 + 24n-1 + B14n-2)2" FP +---, 

and equating powers of z, we find 

by-1 = 81 

bn—2 = 82+ 814n-1 

bn—3 = 83 + S24n-1 + 814n—2 

(17) 

by = 8n—1 + Sn—24n-1 +++ + 8142 
bo = 8nt+ Sn—-14n-1 +++ + 2141 
O = Se41 + 8Kan-1 +++ + 8k—-n4190 kk S01. 
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From this, we have, for k > n, 
n 


S8kt1 = x —AjSk—-j41, 
j=l 
which not only shows that (15) holds, but also shows that a; = —a;. Hence by Theorem 2, H 
must have finite rank. 


Conversely, suppose H has finite rank. Then (15) holds, and we may construct a(z) from a, 
and b(z) from (17) to create a rational function. This function must be coprime since its order 
n is the smallest possible. 


The construction in Theorem 1 is simple to extend to the case in which G(z) is only proper and 
not strictly proper; the additional coefficient by is simply the feed-through term in the impulse 
response, that is, go. 


A result of Theorem 2 is that given finite-dimensional, full-rank matrices 


8k Sk4+1 °°* Sk+n-1 
Sk+1 Sk+2°°* Sk+n 
H= : ; ; (18) 
Skt+n—1 8k+n *** &k+2n—2 
and 
S8k+1 8k+2 °** S8ktn 
Sk+2 Sk+3 °° * Sk+n+1 
a= : ; ; (19) 
Sk+n Sk+n+1 °° * Sk4+2n-1 
the coefficients of a(z) may be calculated as 
00---0 -—ag 
10---0 -a 
O01---0 -—a _ Hy Bea (20) 
00--- 1 —ay_1 


Notice that (11) is in fact a special case of (20). Thus we need only know the first 2n + 1 
impulse-response coefficients to reconstruct the transfer function G(z): 2n to form the matrices 
Hy, and Hy, from (18) and (19), respectively, and possibly the initial coefficient go in case of 
an n'h-order b(z). 


Matrices with the structure of H are useful enough to have a special name. A Hankel matrix 
H is a matrix constructed from a sequence {h;} so that each element H, jk) = Mj+k- For the 
Hankel matrix in (13), hy; = g,%_1. Hy also has an interesting property implied by (20): its row 
space is invariant under shifting of the index k. Because its symmetric, this is also true for its 
column space. Thus this matrix is also often referred to as being shift-invariant. 


While (20) provides a potential method of identifying a system from a measured impulse 
response, this is not a reliable method to use with measured impulse response coefficients that 
are corrupted by noise. The exact linear dependence of the coefficients will not be identical 
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for all k, and the structure of (20) will not be preserved. Inverting Hy will also invert any 
noise on g;, potentially amplifying high-frequency noise content. Finally, the system order n 
is required to be known beforehand, which is usually not the case if only an impulse response 
is available. Fortunately, these difficulties may all be overcome by reinterpreting the results 
Kronecker’s theorem in a state-space framework. First, however, we more carefully examine 
the role of the Hankel matrix in the behavior of LTI filters. 


2.4. Hankel and Toeplitz operators 


The Hankel matrix of impulse response coefficients (13) is more than a tool for computing 
the transfer function representation of a system from its impulse response. It also defines the 
mapping of past input signals to future output signals. To define exactly what this means, we 
write the convolution of (4) around sample k = 0 in matrix form as 


- 0 
y-3 - 80 > | | u3 
Y-2 i281 2h eo 
Vat) |" e810 Heh) 
Yo *** 83.89 81/80 uo 
Y1 "** 84 83 82/81 80 M4 
Y2 "++ 85 84 83/82 81 80 42 


where the vectors and matrix have been partitioned into sections for k < 0 and k > 0. The 
output for k > 0 may then be split into two parts: 


Yo 81 82 83 °°" | | u-1 & <:: O ug 
Y1 82 83 84° °° u_2 81 80 : uy 
= + P (21) 


Y2 8&3 §4 85 °°" u_3 &2 81 80 ug 


where the subscripts p and f denote “past” and “future,” respectively. The system Hankel 
matrix H has returned to describe the effects of the past input u pon the future output y fe Also 
present is the matrix T, which represents the convolution of future input u f with the impulse 
response. Matrices such as T with constant diagonals are called Toeplitz matrices. 


From (21), it can be seen that H defines the effects of past input on future output. One 
interpretation of this is that H represents the “memory” of the system. Because H is a linear 
mapping from uy to y fr the induced matrix 2-norm of H, ||H]||,, can be considered a function 
norm, and in a sense, ||H||, is a measure of the “gain” of the system. ||H||, is often called the 
Hankel-norm of a system, and it plays an important role in model reduction and in the analysis 
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of anti-causal systems. More information on this aspect of linear systems can be found in the 
literature of robust control, for instance, [13]. 


3. State-space representations 


Although transfer functions define system behavior completely with a finite number 
of parameters and simplify frequency-response calculations, they are cumbersome to 
manipulate when the input or output is multi-dimensional or when initial conditions must 
be considered. The other common representation of LTI filters is the state-space form 


Xep1 = Ax, + Bug (22) 
Yk = Cx~ + Duy, 


in which x; € IR" is the system state. The matrices A € R”"*", B € R'*™, C € R'*", and 
D € R'*"™ completely parameterize the system. Only D uniquely defines the input-output 
behavior; any nonsingular matrix T’ may be used to change the state basis via the relationships 


x =x Al =T'AT'! B=TB Ce=ctT!, 


The Z-transform may also be applied to the state-space equations (22) to find 


2[xp+1] = AZ [xi] + BZ [ug > X(z)z = AX(z) + BU(z) 
Z [yx] = CZ [x4] + DZ [uy] => Y(z) = CX(z) + DU(z) 
Te) = Zz Z)= zi — “1 
ue) oe G(z) =C(zIl— A)" B+D, (23) 


and thus, if (22) is the state-space representation of the single-variable system (6), then a(z) is 
the characteristic polynomial of A, det(zI — A). 


Besides clarifying the effect of initial conditions on the output, state-space representations are 
inherently causal, and (23) will always result in a proper system (strictly proper if D = 0). 
For this reason, state-space representations are often called realizable descriptions; while the 
forward-time-shift of z is an inherently non-causal operation, state-space systems may always 
be constructed in reality. 


3.1. Stability, controllability, and observability of state-space representations 


The system impulse response is simple to formulate in terms of the state-space parameters by 
calculation of the output to a unit impulse with xp = 0: 


D k=0 
- a 24 
8k (ae k>0 ey 


Notice the similarity of (10) and (24). In fact, from the eigenvalue decomposition of A, 
A=VAV}, 


we find 


y [gel = : Jca* 1B] = a ICV] (JAE) lv}. 
i k=1 k=1 
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The term |A*~1| will only converge if the largest eigenvalue of A is within the unit circle, and 
thus the condition that all eigenvalues A; of A satisfy |A;| < 1 is a necessary and sufficient 
condition for stability. 


For state-space representations, there is the possibility that a combination of A and B will 
result in a system for which x, cannot be entirely controlled by the input u;. Expressing x; in 
a matrix-form similar to (21) as 


xy =C ba C = [B AB A2B -«.] (25) 
3 ¥ 


demonstrates that x; is in subspace R” if and only if C has rank n. C is the controllability matrix 
and the system is controllable if it has full row rank. 


Similarly, the state x, may not uniquely determine the output for some combinations of A and 
C. Expressing the evolution of the output as a function of the state in matrix-form as 


Yk Cc 
Ye+1 CA 
Yeyo| =O%r, O= | CA? 


demonstrates that there is no nontrivial null space in the mapping from x, to y; if and only 
if O has rank n. O is the observability matrix and the system is observable if it has full column 
rank. 


Systems that are both controllable and observable are called minimal, and for minimal systems, 
the dimension n of the state variable cannot be reduced. In the next section we show that 
minimal state-space system representations convert to coprime transfer functions that are 
found through (23). 


3.2. Construction of state-space representations from impulse responses 


The fact that the denominator of G(z) is the characteristic polynomial of A not only allows 
for the calculation of a transfer function from a state-space representation, but provides an 
alternative version of Kronecker’s theorem for state-space systems, known as the Ho-Kalman 
Algorithm [3]. From the Caley-Hamilton theorem, if a(z) is the characteristic polynomial of 
A, then a(A) = 0, and 


CA‘a(A)B = CA‘ (a" Ago AP ace ge AA a9) B 
k-+ — k+j 
n 
= CAS™B+ ) ajCA*B, 
j=0 
which implies 


n—1 : 
CAMB = — » ajCA*t/B. (26) 
/= 
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Indeed, substitution of (24) into (26) and rearrangement of the indices leads to (15). 
Additionally, substitution of (24) into the product of O and C shows that 


CB CAB CA*B--- 91 8 30+ 
CAB CA?B CA°B --- 82 83 84°°° 


OC = |CA2B CA3B CA4B---| = 83 84 BR =H, 


which confirms our previous statement that H effectively represents the memory of the 
system. Because 
rank(H) = min{rank(Q), rank(C)}, 


we see that rank(H) = n implies the state-space system (22) is minimal. 
If the entries of H are shifted forward by one index to form 
&2 §3 84° °° 


_ 83 84 85 °°" 
H = | 94 35 86 °°*| + 


then once again substituting (24) reveals 
H=OAC. (27) 


Thus the row space and column space of H are invariant under a forward-shift of the indices, 
implying the same shift-invariant structure seen in (20). 


The appearance of A in (27) hints at a method for constructing a state-space realization from 
an impulse response. Suppose the impulse response is known exactly, and let H; be a finite 
slice of H with r block rows and L columns, 


81 82 83 *** 8L 
82 83 84 °** SL+1 
H,= | 83 84 8 °°" 8L+2 


8r—-1 8r Sr4+1 °° 8r4+L-1 
Then any appropriately dimensioned factorization 


Cc 
CA 


2 
Hp =O,C,=|°4 | [BAB A2B--- AL-1B] (28) 
cart 
may be used to find A for some arbitrary state basis as 


A= (O,)' H, (C,)* (29) 
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where H, is H, with the indices shifted forward once and (-)* is the Moore-Penrose 
pseudoinverse. C taken from the first block row of O;, B taken from the first block column of 
Cy, and D taken from go then provides a complete and minimal state-space realization from 
an impulse response. Because H; has rank n and det(zI — A) has degree n, we know from 
Kronecker’s theorem that G(z) taken from (23) will be coprime. 


However, as mentioned before, the impulse response of the system is rarely known exactly. In 
this case only an estimate H, with a non-deterministic error term is available: 


A, =H, +E. 


Because E is non-deterministic, H will always have full rank, regardless of the number of 
rows r. Thus n cannot be determined from examining the rank of H, and even if n is known 
beforehand, a factorization (28) for r > n will not exist. Thus we must find a way of reducing 
the rank of H, in order to find a state-space realization. 


3.3. Rank-reduction of the Hankel matrix estimate 


If H, has full rank, or if n is unknown, its rank must be reduced prior to factorization. 
The obvious tool for reducing the rank of matrices is the singular-value decomposition (SVD). 
Assume for now that n is known. The SVD of H, is 


A, =uxv! 


where U and V! are orthogonal matrices and &% is a diagonal matrix containing the 
nonnegative singular values 7; ordered from largest to smallest. The SVD for a matrix is unique 
and guaranteed to exist, and the number of nonzero singular values of a matrix is equal to its 
rank [14]. 


Because U and V! are orthogonal, the SVD satisfies 
a ju=v"||, = Ello = 01 (30) 


where ||-||, is the induced matrix 2-norm, and 


i 1/2 
a jjuzv7|| = Z\|p = (=) (31) 
t 


where ||-||- is the Frobenius norm. Equation (30) also shows that the Hankel norm of a system 
is the maximum singular value of H,. From (30) and (31), we can directly see that if the SVD 
of H, is partitioned into 


. x vi 
A, = [Un Us] fe | yer 
8 


where U,, is the first n columns of U, X, is the upper-left n x n block of &, and vr is the first 
n rows of V", the solution to the rank-reduction problem is [14] 


Q= arg min ||Q— Ar||, = arg min ||Q— Fi ||p = eave 
rank(Q)=n rank(Q)=n 
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Additionally, the error resulting from the rank reduction is 
e=||Q-A, 


which suggests that if the rank of H; is not known beforehand, it can be determined 
by examining the nonzero singular values in the deterministic case or by searching for a 
significant drop-off in singular values if only a noise-corrupted estimate is available. 


li =On41, 


3.4. Identifying the state-space realization 
From a rank-reduced H,, any factorization 
A, = O/C, 


can be used to estimate O; and Cy. The error in the state-space realization, however, will 
depend on the chosen state basis. Generally we would like to have a state variable with a 
norm ||2,||, in between ||1,||, and ||y,||. As first proposed in [5], choosing the factorization 


Ost and ~G,Sa)/*V, (32) 
results in 
[Orlls = HCxIlo = Vl|Fello. (33) 


and thus, from a functional perspective, the mappings from input to state and state to 
output will have equal magnitudes, and each entry of the state vector x, will have similar 
magnitudes. State-space realizations that satisfy (33) are sometimes called internally balanced 
realizations [11]. (Alternative definitions of a “balanced” realization exist, however, and it is 
generally wise to verify the definition in each context.) 


Choosing the factorization (32) also simplifies computation of the estimate A, since 
a math pe At 
A= (Or) Hr (Cx) 
= x /2uT Vayda |. 


By estimating B as the first block column of C,, C as the first block row of O;, and D as 80, a 
complete state-space realization (A, B.C; D) is identified from this method. 


3.5. Pitfalls of direct realization from an impulse response 


Even though the rank-reduction process allows for realization from a noise-corrupted estimate 
of an impulse response, identification methods that generate a system estimate from a Hankel 
matrix constructed from an estimated impulse response have numerous difficulties when 
applied to noisy measurements. Measuring an impulse response directly is often infeasible; 
high-frequency damping may result in a measurement that has a very brief response before 
the signal-to-noise ratio becomes prohibitively small, and a unit pulse will often excite 
high-frequency nonlinearities that degrade the quality of the resulting estimate. 


Taking the inverse Fourier transform of the frequency response guarantees that the estimates 
of the Markov parameters will converge as the dataset grows only so long as the input is 
broadband. Generally input signals decay in magnitude at higher frequencies, and calculation 
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of the frequency response by inversion of the input will amplify high-frequency noise. We 
would prefer an identification method that is guaranteed to provide a system estimate that 
converges to the true system as the amount of data measured increases and that avoids 
inverting the input. Fortunately, the relationship between input and output data in (21) may 
be used to formulate just such an identification procedure. 


4. Realization from input-output data 


To avoid the difficulties in constructing a system realization from an estimated impulse 
response, we will form a realization-based identification procedure applicable to measured 
input-output data. To sufficiently account for non-deterministic effects in measured data, we 
add a noise term v; € IR”Y to the output to form the noise-perturbed state-space equations 


Xp. = Axp + Bug (34) 
YR = Cxg + Dug + Ug. 


We assume that the noise signal v; is generated by a stationary stochastic process, which may 
be either white or colored. This includes the case in which the state is disturbed by process 
noise, so that the noise process may have the same poles as the deterministic system. (See [15] 
for a thorough discussion of representations of noise in the identification context.) 


4.1. Data-matrix equations 


The goal is to construct a state-space realization using the relationships in (21), but doing 
so requires a complete characterization of the row space of H;. To this end, we expand a 
finite-slice of the future output vector to form a block-Hankel matrix of output data with r 
block rows, 


Yo ¥1 Y2 -**) YL 
Yo¥2 ¥3 °°: YH 


y=] ¥2 ¥3 Ya -** YL4+2 


Yr—1 Yr Yrs +++ Yr+L-1 
This matrix is related to a block-Hankel matrix of future input data 


Up Uy un ::: uy 
uy ung Uy -:: Ups 
ug uz Ug «+ Ups 
Ur = 2 3 4 L4+2 ‘ 
Uy_y Ur Uppy *** Upp E-1 


a block-Toeplitz matrix of past input data 


uU_y Up Uy +++ Up-1 
u_2 U_1 Ug ++: UL-2 
Up = | u_3 u_g U_y «+ Up_3 |» 
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a finite-dimensional block-Toeplitz matrix 


80 HD 
81 80 


T=] 8 81 80 ; 


8r—-1 8r—2 8r-3 *** 80 


the system Hankel matrix H, and a block-Hankel matrix V formed from noise data v;, with 
the same indices as Y by the equation 


Y = HU, + Tuy + V. (35) 
If the entries of Y¢ are shifted forward by one index to form 
YW. Yo ¥3 °°* YEH 


Y2 Y3 Ya ++ YL42 
Y= |¥3 Ya Ys °°: YL43 


Yr Vr4a Yrt2 °° ° Yr+Lh 


then Y; is related to the shifted system Hankel matrix H, the past input data Uy, T with a 
block column appended to the left, and Uf with a block row appended to the bottom, 


81 
82 

T= 83 T , Uf = Ur ; 
: Ur Upyy Up42 + * Ute 
8r 


and a block-Hankel matrix V of noise data v, with the same indices as Y by the equation 


Y = Hu, +TU,; +V. (36) 


From (25), the state is equal to the column vectors of Uy, multiplied by the entries of the 
controllability matrix C, which we may represent as the block-matrix 


X= [eg Hy toes ye] = CU, 


which is an alternative means of representing the memory of the system at sample 0, 1,.... 
The two data matrix equations (35) and (36) may then be written as 


Y=O,;X+TU;f+V, (37) 
Y =O,AX+TUf+V. (38) 


Equation (37) is basis for the field of system identification methods known as subspace 
methods. Subspace identification methods typically fall into one of two categories. First, 
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because a shifted observability matrix 


satisfies 


im(O) =im(O), 
where im(-) of denotes the row space (often called the “image”), the row-space of O is 
shift-invariant, and A may be identified from estimates O; and O; as 


A= 0t0,. 
Alternatively, because a forward-propagated sequence of states 
X= AX 


satisfies sy 
im(X') = im(X’ ), 


the column-space of X is shift-invariant, and A may be identified from estimates Xand X as 
A= kat. 


In both instances, the system dynamics are estimated by propagating the indices forward by 
one step and examining a propagation of linear dynamics, not unlike (20) from Kronecker’s 
theorem. Details of these methods may be found in [16] and [17]. In the next section 
we present a system identification method that constructs system estimates from the 
shift-invariant structure of Y itself. 


4.2. Identification from shift-invariance of output measurements 


Equations (37) and (38) still contain the effects of the future input in Uy. To remove these 
effects from the output, we must first add some assumptions about Uy First, we assume that 


U, has full row rank. This is true for any U; with a smooth frequency response or if UF is 
generated from some pseudo-random sequence. Next, we assume that the initial conditions 
in X do not somehow cancel out the effects of future input. A sufficient condition for this is to 


require 
k = = 
ran. Uy =n+rny 


to have full row rank. Although these assumptions might appear restrictive at first, since it is 
impossible to verify without knowledge of X, it is generally true with the exception of some 
pathological cases. 


Next, we form the null-space projector matrix 


ow fd eet jo 
T= Iny1 — Uf (u;U;) Up, (39) 
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which has the property 
U,l =0. 


We know the inverse of (UU) exists, since we assume U has full row rank. Projector 
matrices such as (39) have many interesting properties. Their eigenvalues are all 0 or 1, and if 
they are symmetric, they separate the subspace of real vectors — in this case, vectors in REI 
— into a subspace and its orthogonal complement. In fact, it is simple to verify that the null 
space of U; contains the null space of Uy as a subspace, since 


_ Us 
Uli = |? | 1=0. 


Thus multiplication of (37) and (38) on the right by IT results in 
YII = O,XI1+ VI, (40) 
YI = O,AXT1 + VIL. (41) 
It is also unnecessary to compute the projected products YI] and YII directly, since from the 
OR-decomposition 
oT vr) Ry Rig 
[a yT| = [21 az) |" ale 
we have 
Y = RpQt + R3Q3 (42) 
and U = R{,Q1. Substitution into (39) reveals 


II = I1-Q, QI. (43) 


Because the columns of Q; and Q> are orthogonal, multiplication of (42) on the right by (43) 
results in 
YI = Rx Q3. 


A similar result holds for YI. Taking the QR-decomposition of the data can alternatively be 
thought of as using the principle of superposition to construct new sequences of input-output 
data through a Gram-Schmidt-type orthogonalization process. A detailed discussion of this 
interpretation can be found in [18]. 


Thus we have successfully removed the effects of future input on the output while retaining 
the effects of the past, which is the foundation of the realization process. We still must account 
for non-deterministic effects in V and V. To do so, we look for some matrix Z such that 


viz! 0, 
viz! > 0. 
This requires the content of Z to be statistically independent of the process that generates vx. 


The input uz is just such a signal, so long as the filter output is not a function of the input — 
that is, the data was measured in open-loop operation,. If we begin measuring input before 
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k = 0 atsome sample k = —¢ and construct Z as a block-Hankel matrix of past input data, 
u 4 u g yu G gern U +L 
u ¢4+1 u f+2 u G+3°° u f+L+1 
Z=t U—¢+2 U-7+3 U_-f44 7 MC 4L42 | 


uy Ug Uy oocts Upp L—2 


then multiplication of (40) and (41) on the right by Z? results in 


Yiiz? + 0,xnz7, (44) 
yuz" = 0,Arz', (45) 


as L — oo. Note the term t in Z is necessary to keep (44) and (45) bounded. 


Finally we are able to perform our rank-reduction technique directly on measured data 
without needing to first estimate the impulse response. From the SVD 


yz? =uxv!, 
we may estimate the order n by looking for a sudden decrease in singular values. From the 
partitioning 

To Xn 0] [Ve 
YTIZ* = [Un Us] 0 ul yr ; 
we nay estimate O, and XIIZ T from the factorization 
O,=Un=¥? and kz? =al?yT, 

A may then be estimated as 


ds A = re t bass 
A= (6,)"Ynz? (an1Z7) =e, 4?ut YNZ" V,2_/? 


— +t ——— 
= (O,)' (HUpI1) (CrUpT1)" = (O,)' H (Cz)". 
And so we have returned to our original relationship (29). 


While C may be estimated from the top block row of O,, our projection has lost the column 
space of H, that we previously used to estimate B, and initial conditions in X prevent us from 
estimating D directly. Fortunately, if A and C are known, then the remaining terms B, D, and 
an initial condition x9 are linear in the input output data, and may be estimated by solving a 
linear-least-squares problem. 


4.3. Estimation of B, D, and x0 


The input-to-state terms B and D may be estimated by examining the convolution with the 
state-space form of the impulse response. Expanding (24) with the input and including an 
initial condition xp results in 


k-1 
yp = CA‘xg + Y) CARI“ Buj + Dug + of. (46) 
j=0 
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Factoring out B and D on the right provides 
k=-1 
yy = CA*xg + y uj @ CAI-} / vec(B) + (uf ® In) vec(D) + 0%, 
j=0 


in which vec(-) is the operation that stacks the columns of a matrix on one another, ® is the 
(coincidentally named) Kronecker product, and we have made use of the identity 


vec(AXB) = (B! @ A)vec(X). 
Grouping the unknown terms together results in 
i x0 
Ve = [cat ne ul @CAKH1 uf @ In, | vec(B) | +0. 
vec(D) 
Thus by forming the regressor 


de = [CAF DIT) uf @ CARI? uf @ Iny | 


from the estimates A and C, estimates of B and D may be found from the least-squares solution 
of the linear system of N equations 


Yo fo 

"1 1 Ro 

y2| = | $2] | vec(B) 
; vec(D) 

YN Nn 


Note that N is arbitrary and does not need to be related in any way to the indices of the data 
matrix equations. This can be useful, since for large-dimensional systems, the regressor 
may become very computationally expensive to compute. 


5. Conclusion 


Beginning with the construction of a transfer function from an impulse response, we 
have constructed a method for identification of state-space realizations of linear filters 
from measured input-output data, introducing the fundamental concepts of realization 
theory of linear systems along the way. Computing a state-space realization from 
measured input-output data requires many tools of linear algebra: projections and the 
QR-decomposition, rank reduction and the singular-value decomposition, and linear least 
squares. The principles of realization theory provide insight into the different representations 
of linear systems, as well as the role of rational functions and series expansions in linear 
algebra. 
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1. Introduction 


Partition-Matrix Theory and Generalized-Inverses are interesting topics explored in linear 
algebra and matrix computation. Partition-Matrix Theory is associated with the problem of 
properly partitioning a matrix into block matrices (i.e. an array of matrices), and is a matrix 
computation tool widely employed in several scientific-technological application areas. For 
instance, blockwise Toeplitz-based covariance matrices are used to model structural 
properties for space-time multivariate adaptive processing in radar applications [1], 
Jacobian response matrices are partitioned into several block-matrix instances in order to 
enhance medical images for Electrical-Impedance-Tomography [2], design of state- 
regulators and partial-observers for non-controllable/non-observable linear continuous 
systems contemplates matrix blocks for controllable/non-controllable and observable/non- 
observable eigenvalues [3]. The Generalized-Inverse is a common and natural problem 
found in a vast of applications. In control robotics, non-collocated partial linearization is 
applied to underactuated mechanical systems through inertia-decoupling regulators which 
employ a pseudoinverse as part of a modified input control law [4]. At sliding-mode control 
structures, a Right-Pseudoinverse is incorporated into a state-feedback control law in order 
to stabilize electromechanical non-linear systems [5]. Under the topic of system 
identification, definition of a Left-Pseudoinverse is present in auto-regressive moving- 
average models (ARMA) for matching dynamical properties of unknown systems [6]. An 
interesting approach arises whenever Partition-Matrix Theory and Generalized-Inverse are 
combined together yielding attractive solutions for solving the problem of block matrix 
inversion [7-10]. Nevertheless, several assumptions and restrictions regarding numerical 
stability and structural properties are considered for these alternatives. For example, an 
attractive pivot-free block matrix inversion algorithm is proposed in [7], which 
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unfortunately exhibits an overhead in matrix multiplications that are required in order to 
guarantee full-rank properties for particular blocks within it. For circumventing the expense 
in rank deficiency, [8] offers block-matrix completion strategies in order to find the 
Generalized-Inverse of any non-singular block matrix (irrespective of the singularity of their 
constituting sub-blocks). However, the existence of intermediate matrix inverses and 
pseudoinverses throughout this algorithm still rely on full-rank assumptions, as well as 
introducing more hardness to the problem. The proposals exposed in [9-10] avoid 
completion strategies and contemplate all possible scenarios for avoiding any rank 
deficiency among each matrix sub-block, yet demanding full-rank assumptions for each 
scenario. In this chapter, an iterative-recursive algorithm for computing a Left- 
Pseudoinverse (LPI) of a MIMO channel matrix is developed by combining Partition-Matrix 
Theory and Generalized-Inverse concepts. For this approach, no matrix-operations’ 
overhead nor any particular block matrix full-rank assumptions are needed because of 
structural attributes of the MIMO channel matrix, which models dynamical properties of a 
Rayleigh fading channel (RFC) within wireless MIMO communication systems. 


The content of this work is outlined as follows. Section 2 provides a description of the 
MIMO communication link, pointing out its principal physical effects and the mathematical 
model considered for RFC-based environments. Section 3 defines formally the problem of 
computing the Left-Pseudoinverse as the Generalized-Inverse for the MIMO channel matrix 
applying Partition-Matrix Theory concepts. Section 4 presents linear algebra and matrix 
computation concepts and tools needed for tracking a solution for the aforementioned 
problem. Section 5 analyzes important properties of the MIMO channel matrix derived from 
a Rayleigh fading channel scenario. Section 6 explains the proposed novel algorithm. Section 
7 presents a brief analysis of VLSI (Very Large Scale of Integration) aspects towards 
implementation of arithmetic operations presented in this algorithm. Section 8 concludes the 
chapter. Due to the vast literature about MIMO systems, and to the best of the authors’ 
knowledge, this chapter provides a nice and strategic list of references in order to easily 
correlate essential concepts between matrix theory and MIMO systems. For instance, [11-16] 
describe and analyze information and system aspects about MIMO communication systems, 
as well as studying MIMO channel matrix behavior under RFC-based environments; [17-18] 
contain all useful linear algebra and matrix computation theoretical concepts around the 
mathematical background immersed in MIMO systems; [19-21] provide practical guidelines 
and examples for MIMO channel matrix realizations comprising RFC scenarios; [22] treats 
the formulation and development of the algorithm presented in this chapter; [23-27] detail a 
splendid survey on architectural aspects for implementing several arithmetic operations. 


2. MIMO systems 


In the context of wireless communication systems, MIMO (Multiple-Input Multiple- 
Output) is an extension of the classical SISO (Single-Input Single-Output) communication 
paradigm, where instead of having a communication link composed of a single 
transmitter-end and a receiver-end element (or antenna), wireless MIMO communication 
systems (or just MIMO systems) consist of an array of multiple elements at both the 
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transmission and reception parts [11-16,19-21]. Generally speaking, the MIMO 
communication link contains n, transmitter-end and n, receiver-end antennas sending- 
and-receiving information through a wireless channel. Extensive studies on MIMO 
systems and commercial devices already employing them reveal that these 
communication systems offer promising results in terms of: a) spectral efficiency and 
channel capacity enhancements (many user-end applications supporting high-data rates 
at limited available bandwidth); b) improvements on Bit-Error-Rate (BER) performance; 
and c) practical feasability already seen in several wireless communication standards. The 
conceptualization of this paradigm is illustrated in figure 1, where Tx is the transmitter- 
end, Rx the receiver-end, and Chx the channel. 


Chx 


Figure 1. The MIMO system: conceptualization for the MIMO communication paradigm. 


Notice that information sent from the trasnmission part (Tx label on figure 1) will suffer 
from several degradative and distorional effects inherent in the channel (Chx label on 
figure 1), forcing the reception part (Rx label on figure 1) to decode information properly. 
Information at Rx will suffer from degradations caused by time, frequency, and spatial 
characteristics of the MIMO communication link [11-12,14]. These issues are directly 
related to: i) the presence of physical obstacles obstructing the Line-of-Sight (LOS) between 
Tx and Rx (existance of non-LOS); ii) time delays between received and transmitted 
information signals due to Tx and Rx dynamical properties (time-selectivity of Chx); iii) 
frequency distortion and interference among signal carriers through Chx (frequency- 
selectivity of Chx); iv) correlation of information between receiver-end elements. Fading 
(or fading mutlipath) and noise are the most common destructive phenomena that 
significantly affect information at Rx [11-16]. Fading is a combination of time-frequency 
replicas of the trasnmitted information as a consequence of the MIMO system phenomena 
i)-iv) exposed before, whereas noise affects information at every receiver-end element 
under an additve or multiplicative way. As a consequence, degradation of signal 
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information rests mainly upon magnitude attenuation and time-frequency shiftings. The 
simplest treatable MIMO communication link has a slow-flat quasi-static fading channel 
(proper of a non-LOS indoor environment). For this type of scenario, a well-known 
dynamical-stochastic model considers a Rayleigh fading channel (RFC) [13,15-16,19-21], 
which gives a quantitative clue of how information has been degradated by means of Chx. 
Moreover, this type of channels allows to: a) distiguish among each information block 
tranmitted from the n, elements at every Chx realization (i.e. the time during which the 
channel’s properties remain unvariant); and b) implement easily symbol decoding tasks 
related to channel equalization (CE) techniques. Likewise, noise is commonly assumed to 
have additive effects over Rx. Once again, all of these assumptions provide a treatable 
information-decoding problem (refered as MIMO demodulation [12]), and the 
mathematical model that suits the aforementioned MIMO communication link 
characteristics will be represented by 


tahoe, (1) 


where: x eZin" ect i is a complex-valued n,-dimensional transmitted vector with 
entries drawn from a Gaussian-integer finite-lattice constellation (digital modulators, such 
as: g-QAM, QPSK); yeC"**' is a complex-valued n,-dimensional received vector; 
7] eC” is a n,-dimensional independent-identically-distributed (idd) complex- 
circularly-symmetric (ccs) Additive White Gaussian Noise (AWGN) vector; and 
H eC"**"" is the (1, xn,)- dimensional MIMO channel matrix whose entries model: a) the 
RFC-based environment behavior according to a Gaussian probabilistic density function 
with zero-mean and 0.5-variance statistics; and b) the time-invariant transfer function 
(which measures the degradation of the signal information) between the i-th receiver-end 
and the j-th trasnmitter-end antennas [11-16,19-21]. Figure 2 gives a representation of (1). 
As shown therein, the MIMO communication link model stated in (1) can be also 
expressed as 


vy My Man, |) x, ™ 
D fa]: : : l+] (2) 


Yap Ny ie Ni Xn, Mri 


Notice from (1-2) that an important requisite for CE purposes within RFC scenarios is that 
H is provided somehow to the Rx. This MIMO system requirement is classically known as 
Channel State Information (CSI) [11-16]. In the sequel of this work, symbol-decoding efforts 
will consider the problem of finding x from y regarding CSI at the Rx part within a slow- 
flat quasi-static RFC-based environment as modeled in (1-2). In simpler words, Rx must find 
xfrom degradated information y through calculating an inversion over H. Moreover, 
Np =n, is commonly assumed for MIMO demodulation tasks [13-14] because it guarantees 
linear independency between row-entries of matrix H in (2), yielding a nonhomogeneous 
overdetermined system of linear equations. 
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Figure 2. Representation for the MIMO communication link model according to y = Hx + 7 . Here, each 


dotted arrow represents an entry h, in H which determines channel degradation between the j-th 


transmitter and the i-th receiver elements. AWGN appears additively in each receiver-end antenna. 


3. Problem definition 


Recall for the moment the mathematical model provided in (1). Consider ©" and ' to be the 
real and imaginary parts of a complex-valued matrix (vector) ©, that is, ®= 0" + jo’. Then, 
Equation (1) can be expanded as follows: 


y' + jy’ =(H'x" —H'x' +o") j(Hix" +H'x' +1') 


It can be noticed from Equation (3) that: x 2eZ yy yvieR ; 


Npx1 , 


tn eR 


(3) 


1 
"RS ~ and 


H",H' €R"s“' , An alternative representation for the MIMO communication link model in 


(2) can be expressed as 


£ H' 
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k NeR™**, 


CSI is still needed for MIMO demodulation purposes involving (4). Moreover, if N, =2n, 


and N, =2n,, then N, >N,. Obviously, while seeking for a solution of signal vector X from 
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(4), the reception part Rx will provide also the solution for signal vector x, and thus MIMO 
demodulation tasks will be fulfilled. This problem can be defined formally into the 
following manner: 


Definition 1. Given parameters N,=2n, ¢Z*andN,=2n,¢Z*, and a_ block-matrix 


heR**’ , there exists an operator [: (RN x RNP: > RN?! which solves the matrix- 


block equation Y=hX+N so that rYh]=X. rT 


From Definition 1, the following affirmations hold: i) CSI over h is a necessary condition as 
an input argument for the operator I’; and ii) [can be naively defined as a Generalized- 


Inverse of the block-matrix h. In simpler terms, X=h'y ! is associated with r[Yh] and 
-1 
ht eR”; stands for the Generalized-Inverse of the block-matrix h, where h* = (Th) h’ 


[17-18]. Clearly, [e]’ and [-[ represent the inverse and transpose matrix operations over 
real-valued matrices. As a concluding remark, computing the Generalized-Inverse h* can 


-1 
be separated into two operations: 1) a block-matrix inversion (h"h) 2, 2) a typical matrix 


-1 
multiplication (hh) -h’. For these tasks, Partition-Matrix Theory will be employed in 


order to find a novel algorithm for computing a Generalized-Inverse related to (4). 


4, Mathematical background 
4.1. Partition-matrix theory 
Partition-Matrix Theory embraces structures related to block matrices (or partition matrices: 


an array of matrices) [17-18]. Furthermore, a block-matrix L with (1+q)x(m+p) dimension 


can be constructed (or partitioned) consistently according to matrix sub-blocks A, B, C, 
and D of nxm,nxp,qxm,and qxp dimensions, respectively, yielding 


fileB 


An interesting operation to be performed for these structures given in (5) is the inversion, 
(n+m)x(n+™) be a full-rank real-valued 


block matrix (the subsequent treatment is also valid for complex-valued entities, ie. 


ie. a blockwise inversion ['. For instance, let LeR 


"In the context of MIMO systems, this matrix operation is commonly found in Babai estimators for symbol-decoding 
purposes at the Rx part [12,13]. For the reader's interest, refer to [11-16] for other MIMO demodulation techniques. 


-1 
2 Notice that h? « RN’*N: and (7h) e RNON 
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(n+m)x(n+m) 


Lec 


CeR””, and DeR”*”. Assume also A and D to be full-rank matrices. Then, 


). An alternative partition can be performed with AeR"”, BeR™”, 


(A = BD"'c) -(A = BD"'c)" BD? ‘6 
Tie 
-(D = cas) cat (D = cas)" 


This strategy (to be proved in the next part) requires additonally and mandatorily full-rank 


a b 
over matrices A-BD'C and D-CA™B. The simple case is defined for L-| i 
c 


2x2 2x2 


(indistinctly for R“ or C°™). Once again, assuming det(L) #0, a¥0,and d+0 (related to 


full-rank restictions within block-matrix L ): 


cay feet) te 9 
Gegeae: Gkey eee 4 


ihe 


where evidently (ad 7 be) #0, RCM) (« = bac] #0,and (a = cab) #0. 


4.2. Matrix Inversion Lemma 


The Matrix Inversion Lemma is an indirect consequence of inverting non-singular block 
matrices [17-18], either real-valued or complex-valued, e.g., under certain restrictions °%. 
Lemma 1 states this result. 


Lemma 1. Let ¥ eRC™’,2e¢ RC, YeRC™, and Se RC” be real-valued or complex- 
valued matrices. Assume these matrices to be non-singular: Y ,Y, (v +ZYE) , and 
(rv + zy) _ Then, 


“9 -1 
(v+EYe) =H t- eo Oia ra zy" z) ay (7) 
Proof. The validation of (7) must satisfy 


i (Y+EY8). i 5 we + =v)" av] =I.,and 


-1 
[ee -woz(y7 + ayy ay") (¥ + rYe) =I,., where I, represents the rxr identity 


BS -1 
matrix. Notice the existance of matrices w', y', (W+ Y=) "and (v" +e¥75) : 


Manipulating i) shows: 


3 Refer to [3,7-10,17,18] to review lemmata exposed for these issues and related results. 
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(¥ +EYz)- [ws - wet i svt)" a 


=1,-2(Y"+ ayy)" ey + ryew! -syeylz(y t+ av7z)" ay 


=1, +EYE¥7 -rr(¥ +e9-z)(17 +s¥7z) ay 


=1,+2YEY!-ryeyv 7 =1. 


Likewise for ii): 
24 
(w -wz(1" +e¥"5) sv).(¥ +ZYE) 
Ayys -1 A agate -1 oe se ae 
=1,+¥rye-ws(y7+ev's) e-y'x(yt+ey7z) sy sys 


-1 
=1,4¥ sve-v7E(y?+eyz) (rt +e s)re 


=1,+P72YE-¥LYE=1,.1 


Now it is pertinent to demonstrate (6) with the aid of Lemma 1. It must be verified that both 


LL* and L'L must be equal to the (n+m)x(n+m) identity block matrix 


I 
Tey ry -|, " all with consistent-dimensional identity and zero sub-blocks: [,, 1, ; 
Onncm? Omxn respectively. We start by calulating 
-1 -1 
A B A-BD'C -(A-BD"'c) BD" 
EES ie ; ( ) ( ) fe (8) 
-(D-cA"B) CAt — (D-CA™B) 
and 
-1 -1 
A-BD'C -(A-BD'c) BD'|T 4 B 
papel oh ) ( : 2 sl ® 
-(D-cA"B) CAt (D-CA) 


by applying (7) in Lemma 1 to both matrices (A-BD“c)" eRc™ and 


(D-cas)" e RC”, which are present in (8) and (9), and recalling full-rank conditions 


not only over those matrices but also for A and D, yields the relations 


Partition-Matrix Theory Applied to the Computation of 
Generalized-Inverses for MIMO Systems in Rayleigh Fading Channels 145 


-1 


(A = BD"'c) 5 a A'B(D é CAB) cAt (10) 


-1 


(D = cas)” ps D'c(A 2 BD''c) BD* (11) 


Using (10-11) in (8-9), the following results arise: 


a. for operations involved in sub-blocks of LU!: 


-1 


A(A a BD"'c) x B(D 2 CAB) Gan 


=4)a" .A"B(D-ca"s)' ca]-a(D-ca")' ae 
ee B(D . cA"B) CAMs B(D = cas) ' CAC HLs 
-a(A-Bp"'c) "Bp" +8(D-ca“)" 
=A) 4*+478(D-ca"8) ‘ca ]pD" +(D-cas) 
=-pp"~a(D-ca“B) ‘cat + B(D-caB) 
nxm 7 


Bp te B(D = cas) (-ca"B " p)D" =0 


c(A = BD"'c) . D(D : cas)" cae 


s c(A S BD"'c) p| >" D'c(A BD"'c) ap" |ca” 
Z c(A ' BDC)" =m ge c(A Z BD"'c) BD“CA7 


mxn’ 


= c(a = BDC) [4 = BD''c |a" -~ca’ =0 
= 


-c(A = BDC). BD D(D . CAB) 


2 -c(A = BDC)" BD? + | >" + Dic(A = BD"'c) aD" 
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m’ 


=-c(A- BD"'c) BD" +I,,+C(A- BD"'c) BD" =I,,; 


thus, LL =I, 


n+m) * 


b. for operations involved in sub-blocks of L™'L: 
(a-Bp"c)' a-(A-BD"c) BDC 
=(A-Bp"c) [4-sD"c ]= 1,3 


(A = BD"'c) pe (A = BD"'c) BD“D =0 


nxm’ 


mxn’ 


-(D = cas) CA Ae (D 5 ca)" C=0 
-(D = cas) | CA™B+ (D = cas) D 


= -(D = cas)" [-ca"'B + D| =13 


thus, D'L=I 


(n+m) * 


4.3. Generalized-Inverse 


The concept of Generalized-Inverse is an extension of a matrix inversion operations applied 
to non-singular rectangular matrices [17-18]. For notation purposes and without loss of 
generalization, »(G) and G’ denote the rank of a rectangular matrix Ge M""", and G' =G" 
is the transpose-conjugate of G (when M=C >GeC”") or G'=G" is the transpose of G 

(when M-R > Ge R"”” ), respectively. 


Definition 2. Let GeM"*" and 0< p(G)<min(m,n). Then, there exists a matrix Gtem™™" 
(identified as the Generalized-Inverse), such that it satisfies several conditions for the 
following cases: 


case i: if m>n and 0< p(G)<min(m,n)>p(c)=n, then there exists a unique matrix 

Gt=G* eM" (identified as Left-Pseudoinverse: LPI) such that G*G = I,, satisfying: a) 
-1 

GG*G =G, and b) G°GG* =G’ . Therefore, the LPI matrix is proposed as G* = (cc) G'. 


case ii: if m=n and det(G) #0 <= p(G) =n, then there exists a unique matrix G' =G! «M”™” 
(identified as Inverse) such that G"'G =GG' =I, . 


case iii: if m<n and 0<p(G)<min(m,n)>p(c)=m, then there exists a unique 
matrix G'+G «M™" (identified as Right-Pseudoinverse: RPI) such that GG =1I,, 
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satisfying: a) GGG=G, and b) GGG =G.. Therefore, the RPI matrix is proposed as 
G =G'(c" y 7 

Given the mathematical structure for G* provided in Definition 2, it can be easily validated 
that: 1) For a LPI matrix stipulated in case i, GG'G=G and G'GG' =G" with Gt =(G ‘e), G'; 
2) For a RPI matrix stipulated in case iii, GG'G=G and G'cG' =G* with ct =G (6c! y ; iii) 
For the Inverse in case ii, G* = =(4" c) G'=G Neg! |" =G . Fora uniqueness test for all cases, 
assume the existance of matrices G] ¢M™” and G} «M”” such that G'G=1, and Gic =I, 
(for case i), and GG} =I, and GG} =1,, (for case iii). Notice immediately, te -G )e= 0, (for 
case i) and c(G} <i) a9 (for case as which obligates Gi =G} for both cases, because of 
full-rank properties over G. Clearly, case ii is a particular consequence of cases i and iii. 


5. The MIMO channel matrix 


The MIMO channel matrix is the mathematical representation for modeling the degradation 
phenomena presented in the RFC scenario presented in (2). The elements h, in H <C""" 
represent a time-invariant transfer function (possesing spectral afore about 
magnitude and phase profiles) between a j-th transmitter and an i-th receiver antenna. Once 
again, dynamical properties of physical phenomena ‘4 such as path-loss, shadowing, 
multipath, Doppler spreading, coherence time, absorption, reflection, scattering, diffraction, 
basestation-user motion, antenna’s physical properties-dimensions, information correlation, 
associated with a slow-flat quasi-static RFC scenario (proper of a non-LOS indoor wireless 
environments) are highlighted into a statistical model represented by matrix H. For H u 
purposes, CSI is a necessary feature required at the reception part in (2), as well as the 
Np 2n, condition. Table 1 provides severaln, >n, MIMO channel matrix realizations for 
RFC-based environments [19-21]. On table 1: a)MIMO(n,,n,): refers to the MIMO 
communication link configuration, i... amount of receiver-end and_ transmitter-end 
elements; b)H,,: refers to a MIMO channel ees realization; c)H;,: refers to the 
Kopi eae LPI, computed as H! = =(HEH,, ) H i ; d) h: blockwise matrix version for 
H,,;@) h’ : refers to the corresponding LPI, computed as h'= (h7 hn) h’. As an additional 
point of analysis, full-rank properties over H and h (and thus the existance of matrices H*, 
H,h*, and h') are validated and corroborated through a MATLAB simulation-driven 
model regarding frequency-selective and time-invariant properties for several RFC-based 
scenarios at different MIMO configurations. Experimental data were generated upon 10° 
MIMO channel matrix realizations. As illustrated in figure 3, a common pattern is found 
regarding the statistical evolution for full-rank properties of H and h with n,2n, at 
several typical MIMO configurations, for instance, MIMO(2,2), MIMO(4,2), and MIMO(4,4). It 
is plotted therein REAL(H,h) against IMAG(H,h), where each axis label denote respectively 
the real and imaginary parts of: a) det(H) and det(h) when n, =n,, and b) det(H"H and 
det(h"h) when . Blue crosses indicate the behavior of »(H) related to det(H) and det H"H) 


“ We suggest the reader consulting references [11-16] for a detail and clear explanation on these narrowband and 
wideband physical phenomena presented in wireless MIMO communication systems. 
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(det(H) legend on top-left margin), while red crosses indicate the behavior of p(h) related to 
det(h) and det(h'h) (det(h) legend on top-left margin). The black-circled zone intersected 
with black-dotted lines locates the 0 + jo value. As depicted on figures (4)-(5), a closer glance 
at this statistical behavior reveals a prevalence on full-rank properties of Hand h, meaning 
that non of the determinants det(H) , det(h), det(H"H) and det(h"h) is equal to zero (behavior 
enclosed by the light-blue region and delimited by blue/red-dotted lines). 


MIMO channel matrix rank-determinat behavior for matrices H and h 


Figure 3. MIMO channel matrix rank-determinant behavior for several realizations for H and h. This 
statistical evolution is a common pattern found for several MIMO configurations involving slow-flat 
quasi-static RFC-based environments with Np 2 1,. 
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1.12 0.15 
0.23 0.96 


0.922 -0.144 
—0221 1.076 


0.85-j047 -0.06+ j0.34 loan” jO.115 -0.06+ 0.26 


-0.37 - j0.72 
0.06 + j0.45 
~0.91+ f1.03 


1.12 0.15 
0.96 


1.12 0.15 
0.23 0.96 


— j0.28 


1.07 - j0.02 


0.051— 0.082 0.217+ 0.116 


114+ 71.13 0.088-—j015 -0206-j0.258 
0234-j0.275 0.15-—/0.018 


0.922 -0.144 


0.221 1.076 0 
0 0.922 -0.144 
0 0.221 1.076 


0.206 
0.15 
0.258 
0.018 
0.15 0258 
0.275 0.018 
0.088 -0.206 
0.234 0.15 


Table 1. MIMO channel matrix realizations for several MIMO communication link configurations at 


slow-flat quasi-static RFC scenarios. 
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MIMO channel matrix rank-determinat behavior for matrices H and h 
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IMAG (H,h) 
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-0.01 


0.01 -0.008 -0.006 -0.002 0 0.002 0.004 0.006 0008 0.01 


REAL (H.h) 


-0.004 


Figure 4. MIMO channel matrix rank-determinant behavior for several realizations for H . Full-rank 
properties for H and H "HE preveal for RFC-based environments (light-blue region delimited by blue- 
dotted lines). 


x 10° MIMO channel matrix rank-determinat behavior for matrices H and h 


7 


IMAG (H,h) 


peeececepoesescees 


0 2 a 6 8 10 12 14 16 18 
REAL (H.h) 107 
Figure 5. MIMO channel matrix rank-determinant behavior for several realizations for h . Full-rank 
properties for hand h‘h preveal for RFC-based environments (light-blue region delimited by red- 
dotted line). 
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6. Proposed algorithm 


The proposal for a novel algorithm for computing a LPI matrix h* «R7"""*"* (with n, >n,) 


is based on the block-matrix structure of h as exhibited in (4). This idea is an extension of 
the approach presented in [22]. The existence for this Generalized-Inverse matrix is 
supported on the statistical properties of the slow-flat quasi-static RFC scenario which 
impact directly on the singularity of H at every MIMO channel matrix realization. Keeping 
in mind that other approaches attempting to solve the block-matrix inversion problem [7-10] 
requires several constraints and conditions, the subsequent proposal does not require any 
restriction at all mainly due to the aforementioned properties of H . From (4), it is suggested 

gl: n{H"} -3{H"| 
that _ | is somehow related to 


-Y ; hence, calculating h* will lead to this 


‘ i ; ae : A -B 
solution. Let A=H" and B=H'’. It is kwon a priori that p( A+ jB) =n,. Then h= ib ‘ 


; : poe ~ % ~ |M -L ‘ 
with p(h) =2n, =N,. Define the matrix Q as Q =h he RN’: where |" zi with 


T me 
M=A'A+B BeR™™T, L=A™B-(A'™B) eR"*"", and p(Q)=N, as a direct consequence 
t q 


from 2n, > 2n, > N, 2N,.It can be seen that 


ht =ath’ e RN, (12) 


For simplicity, matrix operations involved in (12) require classic multiply-and-accumulate 


N,xN, 


operations between row-entries of ateR ‘and column-entries of h? ¢ R™:*: . Notice 


immediately that the critical and essential task of computing h” relies on finding the block 
matrix inverse G7” 5. The strategy to be followed in order to solve 4 in (12) will consist of 
the following steps: 1) the proposition of partitioning Q without any restriction on rank- 
defficiency over inner matrix sub-blocks; 2) the definition of iterative multiply-and- 
accumulate operations within sub-blocks comprised in ©; 3) the recursive definition for 
compacting the overall blockwise matrix inversion. Keep in mind that matrix Q can be also 


Ou 1N, 


: 7 28 4 : . - |M -L ; ; 
viewed as Q=| : ., : . The symmetry presented in = : will motivate 


On 1 nN, 


the development for the pertinent LPI-based algorithm. From (12) and by the use of Lemma 
Q 


\ 2 ne “4 * x 
1 it can be concluded that a ‘1 , where Q =(M + LM"L) eR’ "rt, P=QXeR™™", 


nxn 
T 


® Notice that (4+jB) =(m +jt)'(a" a). Moreover, (M+jL) = ¢+j£eC""", where 
C= (m + LM"L) = ( + M'M) ML' and é= -m'L(m + LM"L). = -(z + Mi'm) 


151 


152 Linear Algebra — Theorems and Applications 


and X=LM' «R"*"" . Interesting enough, full-rank is identified at each matrix sub-block in 
the main diagonal of © (besides p(Q) =n, ). This structural behavior serves as the leitmotiv 


for the construction of an algorithm for computing the blockwise inverse 47’. Basically 
speaking and concerning step 1) of this strategy, the matrix partition procedure obeys the 
assignments (13-16) defined as: 


o a) 
N, -(2k+1),N,-(2k+1)  ©N,-(2k+1),N, -2k 
W, = Z ( ) t ( ) ao ( ) t e R2*2 (13) 
IN, -2k,N,-(2k+1) ON,-2k,N,-2k | 
IN, -(2k+1),N, -(2k-1 IN, -(2k+1),N 
xX, = ( ) t ( ) i ( ) tle R2*2k (14) 
On,-2k,N,-(2k-1) 177 N, -2k,N, 
On, -(2k-1),N,-(2k+1) Nn, -(2k-1),N,-2k 
: : 2kx2 
Y= ; : eR (15) 
On, .N,-(2k+1) On), N, -2k 
On, -1,N,-1 ®N,-1,N 
Zi = ‘ t t ne tle R22 (16) 
®n,.N,-1 N,N, 


The matrix partition over Q obeys the index k=1:1:(N,/2 -1). Because of the even- 


rectangular dimensions of Q, matirx Q owns exactly an amount of N, j2=t, sub-block 


matrices of 2x2 dimension along its main diagonal. Interesting enough, due to RFC-based 
environment characteristics studied in (1) and (4), it is found that: 


o(W,) = p(Z,) =2 a7) 


After performing these structural characteristics for Q, and with the use of (13-16), step 2) 
of the strategy consists of the following iterative operations also indexed by 
k=1:1:(N,/2 -1), in the sense of performing: 


b, = Wh - XZ (18) 
Sige aat 

a, = $ Xi Ly 4 (19) 

9. = Ze + Ze (20) 


Here: 24 e ROK & € R?*:, a, € R??* and 6G, € Ree Steps stated in (18-20) help to 
construct intermediate sub-blocks as 
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= 
We X, Z a 
—— a nw —— 
~ ae 2x2 2x2k 
Q, = 2x2 ad ae Q;! = x : x (21) 
—— “— -0,Y,W, 
akepx2key | oe Chet 2(k+1)x2(k+1) OY Wy 4, 
2kx2  2kx2k Dkx2 Qkx2k 


The dimensions of each real-valued sub-block in (21) are indicated consistently °. For step 3) of 
the strategy, a recursion step ZZ) is provided in terms of the assignment 
Ze =O; e RAED AD) | Clearly, only inversions of W,, Z,, and ¢ (which are 2x2 
matrices, yielding correspondingly W, 7 Zane and ¢ ty are required to be performed 
throughout this iterative-recursive process, unlike the operation linked to Zo) , which comes 
from a previous updating step associated with the recursion belonging to an Although 
p(Q) =N, assures the existance of Q", full-rank requirements outlined in (17) and non-zero 
determinants for (18) are strongly needed for this iterative-recursive algorithm to work 
accordingly. Also, full-rank is expected for every recursive outcome related to Z;'(Z;',)- 
Again, thank to the characteristics of the slow-flat quasi-static RFC-based environment in 
which these operations are involved among every MIMO channel matrix realization, 
conditions in (17) and full-rank of (18) are always satisfied. These issues are corroborated with 
the aid of the same MATLAB-based simulation framework used to validate full-rank 
properties over H and h. The statistical evolution for the determinants for W,, Z,), and ¢,, 
and the behavior of singularity within the Z,'(Z;,',) recursion are respectively illustrated in 
figures (6)-(8). MIMO(2,2), MIMO(4,2), and MIMO(4,4) were the MIMO communication link 
configurations considered for these tests. These simulation-driven outcomes provide 
supportive evidence for the proper functionality of the proposed iterative-recursive algorithm 
for computing Qt involving matrix sub-block inversions. On each figure, the statistical 
evolution for the determinants associated with Z,,W,,¢, and Zee) are respectively 
indicated by labels det(Zo), det(Wk), det(Fik), and det(iZk,iZkm1), while the light-blue zone at 
bottom delimited by a red-dotted line exhibits the gap which marks the avoidance in rank- 
deficincy over the involved matrices. The zero-determinant value is marked with a black circle. 


The next point of analysis for the behavior of the h* LPI-based iterative-recursive algorithm 
is complexity, which in essence will consist of a demand in matrix partitions (amount of 
matrix sub-blocks: PART) and arithmetic operations (amount of additions-subtractions: 
ADD-SUB; multiplications: MULT; and divisions: DIV). Let PART-mtx and ARITH-ops be 
the nomenclature for complexity cost related to matrix partitions and arithmetic operations, 
respectively. Without loss of generalization, define c[*] as the complexity in terms of the 


° Matrix structure given in (21) is directly derived from applying Equation (6), and by the use of Lemma 1 as 


4 “4 
(Z4 -y,w,"x,) = Ze, 4+ZY. (w, ey y,) 0 Gy Aig . See that this expansion is preferable instead of 


k-1°k kU k-17k 


=I - -1 1 1 =t =I 3 F : , 
(w, -XZ v,) =W, +W, KZ. -Y,W, x,) Y,W, , which is undesirable due to an unnecessary matrix 


kk-1 7k 


operation overhead related to computing Z e.g. inverting Ze , which comes preferably from the ZZ 3) 


k-17 


recursion. 
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costs PART-mtx and ARITH-ops belonging to operations involved in +. Henceforth, 
c{h* | =c[a7]+c[ an" | denotes the cost of computing h* as the sum of the costs of 
inverting Q and multiplying 47 by h’. It is evident that: a) c[ on" | implies PART=0 


and ARITH-ops itemized into MULT=8n,n,, ADD-SUB=4n,n (2n, -1), and DIV=0; b) 


ROT 


C [o"] =C [nen ]+c [(a"hy]. Clearly, c{h'h | demands no partitions at all, but with a 


ARITH-ops cost of MULT=8n,n°, and ADD-SUB=4(2n, -1)n7. However, the principal 


complexity relies critically on C [cn"hy" | , which is the backbone for h* , as presented in [22]. 


Table 2 summerizes these complexity results. For this treatment, C [ca"hy” | consists of 


n,-1 n,-1 n,-1 
3n, -2 partitions, MULT = >’C/+6, ADD-SUB = )°C/'+1, and DIV = )\C/"+1. The 
k=1 k=1 k=1 


ARITH-ops cost depends on C/, Ci’, and ci"; the constant factors for each one of these 


items are proper of the complexity presented in Cla, a . The remain of the complexities, i.e. 


IIT 


Cy Gand e 


particularly expressed in terms of 


, are calculated according to the iterative stpes defined in (18-20) and (21), 


c[ 9," ]+c[-a, J+c[-0,y,w,"]+c[a,] (22) 


It can be checked out that: a) no PART-mtx cost is required; b) the ARITH-ops cost employs 
(22) for each item, yielding: Cj = 40k? + 24k +12 (for MULT), Cl = 40k? +2 (for ADD_SUB), 
and C;" =2 (for DIV). 


An illustrative application example is given next. It considers a MIMO channel matrix 
realization obeying statistical behavior according to (1) and a MIMO(4,4) configuration: 


| 0.3059 + j0.7543 —0.8107 + j0.2082 0.2314 — j0.4892 —0.416 — 71.0189 

-1.1777 + j0.0419 0.8421 — j0.9448 = 0.1235 + j0.6067 =: 1.5437 + j0.4039 : 
H= d : d d eC with p(H)=4. Asa 
0.0886 — 0.0676 0.8409 + 70.5051 —-0.132 + j0.8867 —0.0964 — j0.2828 


| 0.2034 — j0.5886 0.0266 + j1.148 0.5132 — 71.1269 0.0806 + j0.4879 


consequence, 
[ 2.4516 -1.2671 0.1362 ~-2.7028 0 -1.9448 0.6022 -0.2002 | 
-1.2671 4.5832  -1.7292 1.3776 1.9448 0 -1.229 -2.4168 
0.1362 -1.7292 3.0132. 0.0913. -0.6022. 1.229 0 0.862 7 
_ |-2.7028 1.3776 0.0913 4.0913 0.2002 2.4168  -0.862 0 eR with p(Q) =8. 
0 1.9448  -0.6022 0.2002 2.4516 1.2671 0.1362 -2.7028 
-1.9448 0 1.229 2.4168 += -1.2671 4.5832. 1.7292 1.3776 
0.6022 -1.229 0 -0.862 0.1362 -1.7292 3.0132 0.0913 
| -0.2002  -2.4168 0.862 0 -2.7028 1.3776 0.0913 4.0913 | 
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Rank-determinant behavior: Zo, Wk, Fik, [iZk.iZkm1] 
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Figure 6. Statistical evolution of the rank-determinant behaviour concerning Zo, W, 7G, and ZZ 


fora mimo(2,2) configuration. 
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Rank-determinant behavior. Zo, Wk, Fik, [iZk,iZkm1] 
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Figure 7. Statistical evolution of the rank-determinant behaviour concerning Z, , W, , ¢,, and 244) 


fora mimo(4,2) configuration. 
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Figure 8. Statistical evolution of the rank-determinant behaviour concerning Z, , W, , ¢,, and Vie a 


fora mimo(4,4) configuration. 
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c[h | 
elas cia? 
c[O ch | c[a™n] 
PART-mtx cost 
° 0 [in PART 
operations] 
Snr Sng MULT 
ARITH-ops cost 
Angnt, (2m; —1)) 4(2n, -1) app-sues | [amouniol 
operations] 
0 0 DIV 


I 
Cc, 5 
8k? +8k+6 


8k? +8k 


8k? +8k+6 


16k? —4k 


Table 2. Complexity cost results of the LPI-based iterative-recursive algorithm for h” . 


Applying partition criteria (13-16) and given « =1:1:3, the following matrix sub-blocks are 
generated: 
2.4516 1.2671 
w= ’ 
1.2671 4.5832 
0.1362 -2.7028 0.1362 ~-1.7292 3.0132 0.0913 
X= ae a po F 
-1.7292 1.3776 —2.7028 1.3776 0.0913 4.0913 
3.0132 0.0913 
W, = j 


0.0913 4.0913 


-0.6022 0.2002 
-0.6022 1.2290 0 0.862 1.229 2.4168 
, % = , 
0 —0.862 
0.862 0 


0.2002 2.4168 -0.862 0 
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3 


1.7292 1.3776 1.9448 0 1.229 -2.4168 


2.4516 —1.2671 5 0.1362 —2.7028 0 -1.9448 0.6022 -0.2002 
1.2671 4.5832 |’? ; 


[ 0.1362 1.7292 | 
2.7028 1.3776 


0 1.9448 


and y, = . Suggested by (18-20), iterative operations (23-25) are computed as: 


6 = Wie Ze Yad Rip Qa Zt Zn (23) 
oy = W, - XZ Ye Ay = $'X,Z;", 0, = Zs + Za Naas (24) 
$3 =W3-X3Z3'Y3, @3=95'X3Z3', 0, = Z3' +Zz'Yaa3 (25) 


From (21), the matrix assignments related to recursion ZetZ) produces the following 
intermediate blockwise matrix results: 


1.5765 0.1235 -0.0307 1.0005 
oe o -—a. 0.1235 0.3332 0.1867 -0.0348 
) -Q?- 1 1 


-OYW" 6, -0.0307 0.1867 0.4432 -0.093 
1.0005 -0.0348 -0.093 0.9191 
-1 
ae oie i 
Dende 2: 2 
[ 0.4098 0.0879 -0.0829  -0.1839 -0.0743  -0.0775 ] 
0.0879 0.4355 -0.2847 0.3182 -0.0422 0.0985 
_| 0.0829 -0.2847 1.7642 0.3393 0.0012 1.0686 ; a (4 \=a7= 6 ~s 
-0.1839 -0.3182 0.3393 0.6023 «0.2376 ~—0.0548 -O,Y,W;' 0, 
0.0743 -0.0422 0.0012 0.2376 (0.4583 -0.0738 
| -0.0775 -0.0985 1.0686 0.0548 -0.0738 0.9499 | 
[ 1.9798 0.3808 -0.1114 1.0224 0 0.3605 0.2524 0.2183 7] 
0.3808 0.6759 0.2619 + 0.0856 «0.3605 =O 0.2368 0.1193 
-0.1114 0.2619 0.5493 -0.0218 -0.2524 0.2368 0 -0.0535 
1.02224 0.0856 -0.0218 0.9839 -0.2183 -0.1193 0.0535 0 |. This last recursive outcome 
“| 0 -0.3605 0.2524 -0.2183 1.9798 0.3808 -0.1114 1.0224 
0.3605 0  -0.2368 -0.1193 0.3808 0.6759 0.2619 (0.0856 
0.2524 0.2368 0 0.0535 -0.1114 0.2619 0.5493 -0.0218 
| 0.2183 0.1193 -0.0535 «= «0s 0.1114 0.0856 = -0.0218 0.9839 | 


from Z;,'(Z;',) corresponds to 4", and is further used for calculating h* ='h’ <R*®. 
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Moreover, notice that full-rank properties are always presented in matrices Z,,W,, W,,W,, 


<I og 4 
$,,0,%,Z,',Z5',and Z;'. 


7. VLSI implementation aspects 


The arithmetic operations presented in the algorithm for computing h* can be 
implemented under a modular-iterative fashion towards a VLSI (Very Large Scale of 
Integration) design. The partition strategy comprised in (13-16) provides modularity, while 
(18-20) is naturally associated with iterativeness; recursion is just used for constructing 
matrix-blocks in (21). Several well-studied aspects aid to implement a further VLSI 
architecture [23-27] given the nature of the mathematical structure of the algorithm. For 
instance, systolic arrays [25-27] are a suitable choice for efficient, parallel-processing 
architectures concerning matrix multiplications-additions. Bidimensional processing arrays 
are typical architectural outcomes, whose design consist basically in interconnecting 
processing elements (PE) among different array layers. The configuration of each PE comes 
from projection or linear mapping techniques [25-27] derived from multiplications and 
additions presented in (18-20). Also, systolic arrays tend to concurrently perform arithmetic 
operations dealing with the matrix concatenated multiplications Ge d. 1X Seay 
Zee: , and 6,Y,W," presented in (18-20). Consecutive additions inside every PE can be 
favourably implemented via Carry-Save-Adder (CSA) architectures [23-24], while 
multiplications may recur to Booth multipliers [23-24] in order to reduce latencies caused by 
adding acummulated partial products. Divisions presented in W, + Ze ,and ¢, T can be 
built through regular shift-and-subtract modules or classic serial-parallel subtractors [23-24]; 
in fact, CORDIC (Coordinate Rotate Digital Computer) processors [23] are also employed 
and configured in order to solve numerical divisions. The aforementioned architectural 
aspects offer an attractive and alternative framework for consolidating an ultimate VLSI 
design for implementing the h* algorithm without compromising the overall system data 
throughput (intrinsicly related to operation frequencies) for it. 


8. Conclusions 


This chapter presented the development of a novel iterative-recursive algorithm for 
computing a Left-Pseudoinverse (LPI) as a Generalized-Inverse for a MIMO channel matrix 
within a Rayleigh fading channel (RFC). The formulation of this algorithm consisted in the 
following step: i) first, structural properties for the MIMO channel matrix acquired 
permanent full-rank due to statistical properties of the RFC scenario; ii) second, Partition- 
Matrix Theory was applied allowing the generation of a block-matrix version of the MIMO 
channel matrix; iii) third, iterative addition-multiplication operations were applied at these 
matrix sub-blocks in order to construct blockwise sub-matrix inverses, and recursively 
reusing them for obtaining the LPI. For accomplishing this purpose, required mathematical 
background and MIMO systems concepts were provided for consolidating a solid scientific 
framework to understand the context of the problem this algorithm was attempting to solve. 


Partition-Matrix Theory Applied to the Computation of 
Generalized-Inverses for MIMO Systems in Rayleigh Fading Channels 


Proper functionality for this approach was validated through simulation-driven 
experiments, as well as providing an example of this operation. As an additional remark, 
some VLSI aspects and architectures were outlined for basically implementing arithmetic 
operations within the proposed LPI-based algorithm. 
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Chapter 8 


Operator Means and Applications 


Pattrawut Chansangiam 


Additional information is available at the end of the chapter 


http://dx.doi.org/10.5772/46479 


1. Introduction 


The theory of scalar means was developed since the ancient Greek by the Pythagoreans until 
the last century by many famous mathematicians. See the development of this subject in 
a survey article [24]. In Pythagorean school, various means are defined via the method 
of proportions (in fact, they are solutions of certain algebraic equations). The theory of 
matrix and operator means started from the presence of the notion of parallel sum as a tool 
for analyzing multi-port electrical networks in engineering; see [1]. Three classical means, 
namely, arithmetic mean, harmonic mean and geometric mean for matrices and operators are 
then considered, e.g., in [3, 4, 11, 12, 23]. These means play crucial roles in matrix and operator 
theory as tools for studying monotonicity and concavity of many interesting maps between 
algebras of operators; see the original idea in [3]. Another important mean in mathematics, 
namely the power mean, is considered in [6]. The parallel sum is characterized by certain 
properties in [22]. The parallel sum and these means share some common properties. This 
leads naturally to the definitions of the so-called connection and mean in a seminal paper [17]. 
This class of means cover many in-practice operator means. A major result of Kubo-Ando 
states that there are one-to-one correspondences between connections, operator monotone 
functions on the non-negative reals and finite Borel measures on the extended half-line. The 
mean theoretic approach has many applications in operator inequalities (see more information 
in Section 8), matrix and operator equations (see e.g. [2, 19]) and operator entropy. The concept 
of operator entropy plays an important role in mathematical physics. The relative operator 
entropy is defined in [13] for invertible positive operators A, B by 


S(A|B) = Al/* log(A71/2BA-1/2) Al/?, (1) 


In fact, this formula comes from the Kubo-Ando theory-S(-|-) is the connection corresponds 
to the operator monotone function t ++ log t. See more information in [7, Chapter IV] and its 
references. 


In this chapter, we treat the theory of operator means by weakening the original definition of 
connection in such a way that the same theory is obtained. Moreover, there is a one-to-one 
correspondence between connections and finite Borel measures on the unit interval. Each 
connection can be regarded as a weighed series of weighed harmonic means. Hence, every 
mean in Kubo-Ando’s sense corresponds to a probability Borel measure on the unit interval. 
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Various characterizations of means are obtained; one of them is a usual property of scalar 
mean, namely, the betweenness property. We provide some new properties of abstract 
operator connections, involving operator monotonicity and concavity, which include specific 
operator means as special cases. 


For benefits of readers, we provide the development of the theory of operator means. In 
Section 2, we setup basic notations and state some background about operator monotone 
functions which play important roles in the theory of operator means. In Section 3, we 
consider the parallel sum together with its physical interpretation in electrical circuits. 
The arithmetic mean, the geometric mean and the harmonic mean of positive operators 
are investigated and characterized in Section 4. The original definition of connection is 
improved in Section 5 in such a way that the same theory is obtained. In Section 6, several 
characterizations and examples of Kubo-Ando means are given. We provide some new 
properties of general operator connections, related to operator monotonicity and concavity, in 
Section 7. Many operator versions of classical inequalities are obtained via the mean-theoretic 
approach in Section 8. 


2. Preliminaries 


Throughout, let B(H) be the von Neumann algebra of bounded linear operators acting on a 
Hilbert space H. Let B(#1)* be the real vector space of self-adjoint operators on H. Equip 
B(H) with a natural partial order as follows. For A, B € B(H)*”, we write A < Bif B— Aisa 
positive operator. The notation T € B(H)* or T > 0 means that T is a positive operator. The 
case that T > 0 and T is invertible is denoted by T > 0 or T € B(#)T*. Unless otherwise 
stated, every limit in B(7) is taken in the strong-operator topology. Write A;, + A to indicate 
that A, converges strongly to A. If A, is a sequence in B(#)*", the expression A, | A means 
that Ay is a decreasing sequence and A, — A. Similarly, A, t A tells us that Ay is increasing 
and A, — A. We always reserve A,B,C, D for positive operators. The set of non-negative 
real numbers is denoted by RT. 


Remark 0.1. It is important to note that if Ay, is a decreasing sequence in B(H.)* such that 
Ayn > A, then A, — A if and only if (Anx,x) — (Ax,x) for all x € H. Note first that this 
sequence is convergent by the order completeness of B(#). For the sufficiency, if x € H, then 


I|(An — A)!/?x||? = ((An — A)!/?x, (An — A)!/?x) = ((An — A)x,x) + 0 
and hence ||(Ayn — A)x|| — 0. 
The spectrum of T € B(H) is defined by 
Sp(T) = {A € C: T—ALis not invertible}. 


Then Sp(T) is anonempty compact Hausdorff space. Denote by C(Sp(T)) the C*-algebra of 
continuous functions from Sp(T) to C. Let T € B(H) be anormal operator and z : Sp(T) > C 
the inclusion. Then there exists a unique unital *-homomorphism ¢ : C(Sp(T)) — B(H) such 
that $(z) = T, ie, 


¢ is linear 


° (3) = o(f)P(g) for all f,g € C(Sp(T)) 
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° $(f) = (p(f))* for all f € C(Sp(T)) 
© oH. 


Moreover, ¢ is isometric. We call the unique isometric *-homomorphism which sends f € 
C(Sp(T)) to P(f) € B(H) the continuous functional calculus of T. We write f(T) for p(f). 


Example 0.2. 1. If f(t) = a9 +a;t+-+-+ ant", then f(T) = ag] + a,T +--+ +ayT". 
2. If f(t) = £, then f(T) = (f) = $(2) = o(z)* =T* 


3. If f(t) = t'/? for t € R* and T > 0, then we define T!/* = f(T). Equivalently, T!/? is the 
unique positive square root of T. 


4. If f(t) = t-1/? for t > Oand T > 0, then we define T~!/* = f(T). Equivalently, T~!/ = 
ie = (T71)1/2, 


A continuous real-valued function f on an interval I is called an operator monotone function if 
one of the following equivalent conditions holds: 


(i) AX B => f(A) < f(B) for all Hermitian matrices A, B of all orders whose spectrums 
are contained in I; 


(ii) AX B => f(A) < f(B) for all Hermitian operators A,B € B(H) whose spectrums are 
contained in J and for an infinite dimensional Hilbert space H; 


(iii) A< B => f(A) < f(B) for all Hermitian operators A,B € B(H.) whose spectrums are 
contained in I and for all Hilbert spaces H. 


This concept is introduced in [20]; see also [7, 10, 15, 16]. Every operator monotone function is 
always continuously differentiable and monotone increasing. Here are examples of operator 
monotone functions: 

1) tre at+fonk, fora >O0andBeER, 

2) t+ -£-! on (0,00), 

3) tH (c- f\-* on (a,b), forc ¢ (a,b), 

4) t+ logt on (0,00), 

5) t+ (ft —1)/logt on Rt, where 0 +> Oand1+> 1. 

The next result is called the Lowner-Heinz’s inequality [20]. 

Theorem 0.3. For A,B € B(H)* andr € [0,1], if A < B, then A" < B". That is the map t ++ t" is 


an operator monotone function on Rt for any r € [0,1]. 


A key result about operator monotone functions is that there is a one-to-one correspondence 
between nonnegative operator monotone functions on IR* and finite Borel measures on [0, co] 
via integral representations. We give a variation of this result in the next proposition. 


Proposition 0.4. A continuous function f : R* — IR¢ is operator monotone if and only if there 
exists a finite Borel measure 1 on [0,1] such that 


f(x) = [oy Lee, ERY v) 
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Here, the weighed harmonic mean !; is defined for a,b > 0 by 
alb = [(1—t)a-'+#b-4]1 (3) 


and extended to a,b > O by continuity. Moreover, the measure is unique. Hence, there is a 
one-to-one correspondence between operator monotone functions on the non-negative reals and finite 
Borel measures on the unit interval. 


Proof. Recall that a continuous function f : Rt — IR* is operator monotone if and only if 
there exists a unique finite Borel measure v on [0,00] such that 


f(z) = f 


i de xéRt 


where 


A+1 
gx (A) = wae) forA>0, x(0)=1, (oc) =x. 


Consider the Borel measurable function  : [0,1] — [0,00],t 4 4;. Then, for each x € Rt, 


i px (A) dv(A) = he x © w(t) dv(t) 


x 


~ fol) x—xt+t ave) 


= 1!,xdv(t). 
Joy rev 
Now, set pt = vp. Since pp is bijective, there is a one-to-one corresponsence between the finite 


Borel measures on [0,00] of the form v and the finite Borel measures on [0,1] of the form vy. 
The map f ++ p is clearly well-defined and bijective. 


3. Parallel sum: A notion from electrical networks 


In connections with electrical engineering, Anderson and Duffin [1] defined the parallel sum of 
two positive definite matrices A and B by 


A:B=(A'+B1)1, (4) 


The impedance of an electrical network can be represented by a positive (semi)definite 
matrix. If A and B are impedance matrices of multi-port networks, then the parallel sum 
A : B indicates the total impedance of two electrical networks connected in parallel. This 
notion plays a crucial role for analyzing multi-port electrical networks because many physical 
interpretations of electrical circuits can be viewed in a form involving parallel sums. This is 
a starting point of the study of matrix and operator means. This notion can be extended to 
invertible positive operators by the same formula. 


Lemma 0.5. Let A,B,C,D,An,Bn € B(H)** foralln € N. 


(1) If An | A, then Az! + Aq}. If An t A, then Az! | Aut. 
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(2) IfA<CandB<D,thnA:B<C:D. 
(3) If An | Aand By | B, then An: By | A: B. 
(4) If An | Aand B, | B, then lim Ay : By exists and does not depend on the choices of Ay, Bn. 


Proof. (1) Assume Ay | A. Then A,, lis increasing and, for each x € H, 
((Aq — A7*)x,x) = ((A— An)A7'x, Ag*x) < ||(A— An)A7*x||] An Illlxl] > 0. 


(2) Follow from (1). 


(3) Let An, Bn € B(H)+* be such that A, | A and By | A where A,B > 0. Then Az! t Aq! 
and B,, | + B~!. So, A; ! + B;,! is an increasing sequence in B(H.)* such that 


Aap SA ae, 


ie. A;!+B,!t A714 B71. By (1), we thus have (A;!+B,!)~-! | (A7'+ B-4)71. 


(4) Let An, By € B(H)** be such that A, | A and B, | B. Then, by (2), An : By is a decreasing 
sequence of positive operators. The order completeness of B(H) guaruntees the existence of 
the strong limit of Ay : By. Let Aj, and By, be another sequences such that Aj, | A and Bi, | B. 
Note that for each n,m € IN, we have An < An + Aj, — A and By < By + By, — B. Then 

An: Bn < (An + Al, — A): (Bn + Bi, —B). 


Note that as n + co, A, + Ai, — A — Al, and B, + Bi, — B > Bi. We have that as n — 00, 


(An + Aj, — A) : (Bn + Bi, — B) + Al, : By 


Hence, limy—+soo An : By < Al, : Bl, and limy—yoo An : Bu < limy—co Al, : Bi,. By symmetry, 
limyn— oo An : Bn > limn—00 Ah, : Bi, 


We define the parallel sum of A, B > 0 to be 


A:B= lim(A + el) : (B+el) (5) 


where the limit is taken in the strong-operator topology. 


Lemma 0.6. For each x € H, 


((A: B)x,x) = inf{(Ay,y) + (Bz,z) :y,z © H,y +z = x}. (6) 


Proof. First, assume that A, B are invertible. Then for all x,y € H, 


(Ay, y) + (B(x — y),x —y) — ((A : B)x,x) 

= (Ay,y) + (Bx, x) — 2Re(Bx, y) + (By, y) — ((B — B(A + B)!B)x, x) 
= ((A+B)y,y) — 2Re(Bx,y) + ((A + B) ‘Bx, Bx) 

= ||(A+B)!/7y|)? — 2Re(Bx,y) + ||(A + B)1/?Bx||? 

>0. 
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With y = (A + B)~!Bx, we have 


(Ay,y) + (B(x — y),x—y) — (A: B)x,x) =0. 


Hence, we have the claim for A,B > 0. For A,B > 0, consider A+ eI and B + eI where 
e | 0. 


Remark 0.7. This lemma has a physical interpretation, called the Maxwell’s minimum power 
principle. Recall that a positive operator represents the impedance of a electrical network while 
the power dissipation of network with impedance A and current x is the inner product (Ax, x). 
Consider two electrical networks connected in parallel. For a given current input x, the current 
will divide x = y +z, where y and z are currents of each network, in such a way that the power 
dissipation is minimum. 


Theorem 0.8. The parallel sum satisfies 

(1) monotonicity: Ay < Az, By, < By => Ay: By < Az: Bo. 

(2) transformer inequality: S*(A : B)S < (S*AS) : (S*BS) for every S € B(H). 
(3) continuity from above: if Ay, | Aand By, | B, then Ayn: By | A: B. 


Proof. (1) The monotonicity follows from the formula (5) and Lemma 0.5(2). 
(2) For each x,y,z € H such that x = y + z, by the previous lemma, 

(S*(A : B)Sx,x) = ((A: B)Sx, Sx) 
< (ASy, Sy) + (S* BSz,z) 
= (S* ASy,y) + (S* BSz,z). 
Again, the previous lemma assures S*(A : B)S < (S*AS) : (S*BS). 


(3) Let A, and B, be decreasing sequences in B(H.)* such that A, | A and B, | B. Then 
An : By is decreasing and A: B < Ay : By for alln € IN. We have that, by the joint 
monotonicity of parallel sum, for all e > 0 


An? By < (Ap +ed) 2 (By, tel): 


Since A, + eI | A+eland B, + e€1 | B+ el, by Lemma 3.1.4(3) we have A; : By, | A: B. 


Remark 0.9. The positive operator S* AS represents the impedance of a network connected 
to a transformer. The transformer inequality means that the impedance of parallel connection 
with transformer first is greater than that with transformer last. 


Proposition 0.10. The set of positive operators on H is a partially ordered commutative semigroup 
with respect to the parallel sum. 


Proof. For A,B,C > 0, we have (A: B): C = A: (B:C) and A: B=B: A. The continuity 
from above in Theorem 0.8 implies that (A : B): C= A:(B:C)andA:B=B: A forall 
A,B,C > 0. The monotonicity of the parallel sum means that the positive operators form a 
partially ordered semigroup. 
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Theorem 0.11. For A,B,C,D > 0, we have the series-parallel inequality 
(A+B):(C+D)>A:C+B:D. (7) 


In other words, the parallel sum is concave. 


Proof. For each x,y,z € H such that x = y + z, we have by the previous lemma that 
((A:C+B:D)x,x) = ((A:C)x,x) + ((B: D)x,x) 


< (Ay, y) + (Cz,z) + (By, y) + (Dz, z) 
= ((A+ B)y,y) +((C + D)z,z). 


Applying the previous lemma yields (A+ B):(C+D)>A:C+B:D. 


Remark 0.12. The ordinary sum of operators represents the total impedance of two networks 
with series connection while the parallel sum indicates the total impedance of two networks 
with parallel connection. So, the series-parallel inequality means that the impedance of a 
series-parallel connection is greater than that of a parallel-series connection. 


4, Classical means: arithmetic, harmonic and geometric means 


Some desired properties of any object that is called a “mean” M on B(#)* should have are 
given here. 


(A8). 
(A9 


betweenness: if A < B, then A < M(A,B) < B; 
. fixed point property: M(A, A) = A. 


(Al). positivity: A,B > 0 = M(A,B) > 0; 
(A2). monotonicity: A > A',B > B' = M(A,B) > M(A',B’); 
(A3). positive homogeneity: M(kA,kB) = kM(A, B) for k € R*; 
(A4). transformer inequality: X* M(A, B)X < M(X* AX, X*BX) for X € B(H); 
(A5). congruence invariance: X* M(A, B)X = M(X* AX, X* BX) for invertible X € B(H); 
(A6). concavity: M(tA + (1—#)B,tA’ + (1—#)B’) > tM(A, A’) + (1—t)M(B, B’) fort € [0,1]; 
(A7). continuity from above: if Ay, | A and B, | B, then M(Ay, By) | M(A,B); 
) 
) 


In order to study matrix or operator means in general, the first step is to consider three classical 
means in mathematics, namely, arithmetic, geometric and harmonic means. 


The arithmetic mean of A,B € B(H)* is defined by 
1 
AVB=5(A+B). (8) 


Then the arithmetic mean satisfies the properties (A1)-(A9). In fact, the properties (A5) and 
(A6) can be replaced by a stronger condition: 


X* M(A, B)X = M(X* AX, X*BX) for all X € B(H). 
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Moreover, the arithmetic mean satisfies 
affinity: M(kA +C,kB+C) =kM(A,B)+C fork € Rt. 
Define the harmonic mean of positive operators A,B € B(H)* by 


A!B=2(A:B) = lim2(Az* + Be") ©) 
€. 


where Ag = A+elI and Be = B+el. Then the harmonic mean satisfies the properties 
(A1)-(A9). 


The geometric mean of matrices is defined in [23] and studied in details in [3]. A usage 
of congruence transformations for treating geometric means is given in [18]. For a given 
invertible operator C € B(H), define 


Tc: B(H)™ > B(H)*, A + CAC. 


Then each Tc is a linear isomorphism with inverse [c-1 and is called a congruence 
transformation. The set of congruence transformations is a group under multiplication. Each 
congruence transformation preserves positivity, invertibility and, hence, strictly positivity. In 
fact, 'c maps B(#H)* and B(H)t* onto themselves. Note also that Ic is order-preserving. 


Define the geometric mean of A,B > 0 by 
A#B = AM?(A MBA M2) 12 Al? = Tain oT 4, (B). (10) 


Then A#B > 0 for A,B > 0. This formula comes from two natural requirements: This 
definition should coincide with the usual geometric mean in R+: A#B = (AB)!/? provided 
that AB = BA. The second condition is that, for any invertible T € B(H), 


T*(A#B)T = (T*AT)#(T*BT). (11) 


The next theorem characterizes the geometric mean of A and B in term of the solution of a 
certain operator equation. 


Theorem 0.13. For each A,B > 0, the Riccati equation Tx(A7!) := XA-!X = Bhasa unique 
positive solution, namely, X = A#B. 


Proof. The direct computation shows that (A#B)A~!(A#B) = B. Suppose there is another 
positive solution Y > 0. Then 


(A-V2xA-1/2)2 — A V/2y¥A 1XA /2_ 4 1/2y4 1lyA V2 = (AO ay Ae 


The uniqueness of positive square roots implies that AW2xXA-V/2 = A-l/2yg-l/2 ie, 
X=Y. 


Theorem 0.14 (Maximum property of geometric mean). For A,B > 0, 
A#B=max{X >0: XA!X<B} (12) 


where the maximum is taken with respect to the positive semidefinite ordering. 
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Proof. If XA~!X < B, then 


and A~M/?2XA-1/2 < (A71/2BA-1/2)1/2 ie. X < A#B by Theorem 0.3. 


Recall the fact that if f : [a,b] + C is continuous and Ay; — A with Sp(An) C [a,b] for all 
n € NN, then Sp(A) C [a,b] and f(An) > f(A). 


Lemma 0.15. Let A,B,C,D,An,By € B(H)** foralln EN. 
(1) IfA< CandB<D, then A#B < C#D. 


(2) If An | Aand By | B, then An#Byn | A#B. 
(3) If An | Aand By, | B, then lim Ay # By exists and does not depend on the choices of Ay, Bn. 


Proof. (1) The extremal characterization allows us to prove only that (A#B)C~!(A#B) < D. 
Indeed, 

=B 

<D. 


(2) Assume Ayn | A and By, | B. Then A, #B, is a decreasing sequence of strictly positive 
operators which is bounded below by 0. The order completeness of B(7) implies that this 
sequence converges strongly to a positive operator. Since A; ! < A~!, the Lowner-Heinz’s 
inequality assures that A; !/? < A~!/2 and hence ||Az!/*|| < ||A~1/2|| for all n € N. Note 
also that ||Bu|| < ||Bi|| for all n € IN. Recall that the multiplication is jointly continuous 
in the strong-operator topology if the first variable is bounded in norm. So, Ay tf +B Ag? 2 
converges strongly to A~'/2BA~1/2, It follows that 


(Ao 3A eye Ss (AS SpA 
Since Al/? is norm-bounded by || A!/2|| by Lowner-Heinz’s inequality, we conclude that 


Alla A AB A he Ale = AV2(A-V2BAV2y1/2 Al/2_ 


The proof of (3) is just the same as the case of harmonic mean. 


We define the geometric mean of A, B > 0 by 


A#B =lim(A + el) #(B + el). (13) 
€. 


Then A#B > 0 for any A,B > 0. 
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Theorem 0.16. The geometric mean enjoys the following properties 

(1) monotonicity: Ay < Az, By, < By > A, #By < Ap # Bo. 

(2) continuity from above: Ay | A, By | B => An#Byn | A#B. 

(3) fixed point property: A# A = A. 

(4) self-duality: (A#B)~! = A~'#B-1. 

(5) symmetry: A#B = B#A. 

(6) congruence invariance: Tc (A)#Tc(B) =Tc(A#B) for all invertible C. 


Proof. (1) Use the formula (13) and Lemma 0.15 (1). 
(2) Follows from Lemma 0.15 and the definition of the geometric mean. 
(3) The unique positive solution to the equation XA1X=AisX =A. 


(4) The unique positive solution to the equation X-!4-1X-! = Bis X-! = A#B. But this 
equstion is equivalent to XAX = B~!. So, A~'#B-! = X = (A#B)71. 


(5) The equation XA~!X = B has the same solution to the equation XB “1X =A by taking 
inverse in both sides. 


(6) We have 
T¢(A#B)(Pc(A))“c(A#B) = Tc(A#B)c-1(A!)Fc(A#B) 
=Tc¢((A#B)A~!(A#B)) 
=Tc(B). 
Then apply Theorem 0.13. 


The congruence invariance asserts that Ic is an isomorphism on B(H.)** with respect to the 
operation of taking the geometric mean. 


Lemma 0.17. For A > 0 and B > 0, the operator 


(é5) 


is positive if and only if B— C* A~!C is positive, i.e., B > C*A71C. 


_ f{1-A-'c 
x=(0°4°). 


eee © es I O\f(AC)\  fI-A-!C 
X\c*B}] \-ctAt ri} \crB/\o 1 
_(A 0 
~\o B—cCtAic )° 


Since Ig preserves positivity, we obtain the desired result. 


Proof. By setting 


we compute 
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Theorem 0.18. The geometric mean A#B of A,B € B(H)* is the largest operator X € B(H)** for 
which the operator 
A X 
a) (14) 
is positive. 
Proof. By continuity argumeny, we may assume that A, B > 0. If X = A#B, then the operator 


(14) is positive by Lemma 0.17. Let X € B(#)** be such that the operator (14) is positive. Then 
Lemma 0.17 again implies that XA-!X < Band 


(A-M2xA-V2)2 = AV2x AXA? < AV eB AV? 


The Léwner-Heinz’s inequality forces A-Wex A? < (ABA). Now, applying 
T giz yields X < A#B. 


Remark 0.19. The arithmetric mean and the harmonic mean can be easily defined for 
multivariable positive operators. The case of geometric mean is not easy, even for the 
case of matrices. Many authors tried to defined geometric means for multivariable positive 
semidefinite matrices but there is no satisfactory definition until 2004 in [5]. 


5. Operator connections 


We see that the arithmetic, harmonic and geometric means share the properties (A1)-(A9) in 
common. A mean in general should have algebraic, order and topological properties. Kubo 
and Ando [17] proposed the following definition: 


Definition 0.20. A connection on B(H)* is a binary operation 7 on B(H)* satisfying the 

following axioms for all A, A’, B, B’,C € B(H)*: 

(M1) monotonicity: A< A',B < B! => AoB< A'oB' 

(M2) transformer inequality: C(Ag B)C < (CAC) a (CBC) 

(M3) joint continuity from above: if An, Bn € B(H)* satisfy An | A and By | B, then Ano Bn | 
AcB. 

The term “connection" comes from the study of electrical network connections. 


Example 0.21. The following are examples of connections: 


the left trivial mean (A, B) +> A and the right trivial mean (A,B) ++ B 
the sum (A,B) > A+B 
the parallel sum 


arithmetic, geometric and harmonic means 


V7 PF wey 


the weighed arithmetic mean with weight « € [0,1] which is defined for each A,B > 0 by 
AV B=(1—a)A+aB 


6. the weighed harmonic mean with weight « € [0,1] which is defined for each A,B > 0 by 
A!y B = [(1—«#)A~! +B~']~! and extended to the case A, B > 0 by continuity. 
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From now on, assume dim H = oo. Consider the following property: 


(M3’) separate continuity from above: if An, By, € B(H)t satisfy A, | A and B, | B, then 
A,oB\|AoBandAcB, | AcB. 


The condition (M3’) is clearly weaker than (M3). The next theorem asserts that we can improve 
the definition of Kubo-Ando by replacing (M3) with (M3’) and still get the same theory. This 
theorem also provides an easier way for checking a binary opertion to be a connection. 


Theorem 0.22. If a binary operation 0 on B(H)* satisfies (M1), (M2) and (M3’), then o satisfies 
(M3), that is, 0 is a connection. 


Denote by OM(IR*) the set of operator monotone functions from IR* to R*. If a binary 
operation o has a property (A), we write 7 € BO(A). The following properties for a binary 
operation ¢ and a function f : R*+ > R* play important roles: 


(P) : Ifa projection P € B(#)+ commutes with A, B € B(H)*, then 
P(A B) = (PA) o (PB) = (Ac B)P; 
(F) : f(t)I = Io (tI) for any ft € R*. 
Proposition 0.23. The transformer inequality (M2) implies 
¢ Congruence invariance: For A,B > 0 and C > 0, C(AoB)C = (CAC) a (CBC); 


¢ Positive homogeneity: For A,B > 0 and a € (0,00), a(AoaB) = (aA) o (wB). 


Proof. For A,B > 0 and C > 0, we have 


c-l(CAC) @ (cBc)|c" < (c-'cacc")o (c-tcBcc!) = AcB 


and hence (CAC)a (CBC) < C(AoaB)C. The positive homogeneity comes from the 
congruence invariance by setting C = /al. 


Lemma 0.24. Let f : Rt — R* be an increasing function. If 0 satisfies the positive homogeneity, 
(M3’) and (F), then f is continuous. 


Proof. To show that f is right continuous at each t € Rt, consider a sequence ft, in Rt such 
that t, | t. Then by (M3’) 


f(tn) =lot,l | Lotl = f(t)L, 


ie. f(tn) | f(t). To show that f is left continuous at each t > 0, consider a sequence ty > 0 
such that t, + t. Then ti! { tl and 


lim, 1 f (tn )I = limit, (Iot,I) = lim(t, {ol = (tT) ol 
=t (Ietl) =t“f(t)I 


Since f is increasing, t; | f (tn) is decreasing. So, t+ t~! f(t) and f are left continuous. 
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Lemma 0.25. Let ¢ be a binary operation on B(H.)* satisfying (M3’) and (P). If f :IR* + R* isan 
increasing continuous function such that o and f satisfy (F), then f(A) = Io A for any A € B(H)?. 


Proof. First consider A € B(H)* in the form 1)", A;P; where {P;}', is an orthogonal family 
of projections with sum I and A; > 0 for alli = 1,...,m. Since each P; commutes with A, we 
have by the property (P) that 


IgA=) P(IcA)=) BoPA=) Poa,P; 
=) Buea =) fOjR =f A). 


Now, consider A € B(H)*. Then there is a sequence A, of strictly positive operators in the 
above form such that A, | A. Then Io A, | Io A and f(A;,) converges strongly to f(A). 
Hence, 1a A = lim Ig Ay = lim f(An) = f(A). 


Proof of Theorem 0.22: Leto € BO(M1, M2, M3’). As in [17], the conditions (M1) and (M2) 
imply that ¢ satisfies (P) and there is a function f : Rt — IRt subject to (F). If0 < th < to, 
then by (M1) 


f(t) = Io (tl) < Io (t2l) = f (to) 1, 


ie. f(ti) < f(t). The assumption (M3’) is enough to guarantee that f is continuous by 
Lemma 0.24. Then Lemma 0.25 results in f(A) = IA for all A > 0. Now, (M1) and the fact 
that dim H = oo yield that f is operator monotone. If there is another g € OM(IR*) satisfying 
(F), then f(t)I = Iotl = g(t)I for eacht > 0,ie. f = g. Thus, we establish a well-defined 
map 7 € BO(M1, M2, M3’) ++ f € OM(IR*) such that ¢ and f satisfy (F). 


Now, given f € OM(IR*), we construct 7 from the integral representation (2) in Proposition 
0.4. Define a binary operation 7 : B(H)* x B(H)* — B(H)t by 


= ! 
AoB [i An Bane) (15) 


where the integral is taken in the sense of Bochner. Consider A, B € B(H)* and set F; = A!; B 
for each t € [0,1]. Since A < ||A||I and B < ||B||1, we get 


AI B I 
Al] + 1 — #) 1B] 


A'NB< |All te BIT 
By Banach-Steinhaus’ theorem, there is an M > 0 such that ||F;|| < M for all t € [0,1]. Hence, 
Filldu(t) < [ Mdu(t) <o. 
Jilin < [Mau 


So, F; is Bochner integrable. Since F; > 0 for all t € [0,1], Sio3) Fy du(t) > 0. Thus, Ag Bisa 
well-defined element in B(H)*. The monotonicity (M1) and the transformer inequality (M2) 
come from passing the monotonicity and the transformer inequality of the weighed harmonic 
mean through the Bochner integral. To show (M3’), let A; | A and B, | B. Then An!;B | 
A |; B for t € [0,1] by the monotonicity and the separate continuity from above of the weighed 
harmonic mean. Let ¢ € H. Define a bounded linear map ® : B(H.) — C by ®(T) = (TG,¢). 
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For each n € N, set T,(t) = An!;B and put T(t) = A!;B. Then for each n € NU {oo}, 
®@ o T, is Bochner integrable and 


(f a(t) an(6,e) = Cf T(t) au(t)) = f & 0 Tu(t) dul) 


Since Ty(t) | Too(t), we have that (T,(t)Z,¢) — (Too(t)Z,é) as n — oo for each t € [0,1]. We 
obtain from the dominated convergence theorem that 


Jim (Ano B)E,2) = Jim, f Ta(#) dn(8)2,8) 


Tim, f (Tn(t)¢, 6) du(t) 


| (Teo(t)6,€) du(t) 


=f Teo(#)du(0)8,2) 


= ((AcB)é,¢). 


So, Ano B | AaB. Similarly, Ao B, | Ao B. Thus, ¢ satisfies (M3’). It is easy to see that 
f(t)I = Io (#1) for t > 0. This shows that the map o +> f is surjective. 


To show the injectivity of this map, let 71,02 € BO(M1, M2, M3’) be such that 0; ++ f where, 
for each t > 0, Io; (tl) = f(t)I,i = 1,2. Since 0; satisfies the property (P), we have Ig; A = 
f(A) for A > 0 by Lemma 0.25. Since g; satisfies the congruence invariance, we have that for 
A>OandB>0, 
Ag, B = Al/?(I1g, A-/27BA V2) AV? = AV ¢(A-V2BA-V/2)Al/2) j= 1,2. 
For each A, B > 0, we obtain by (M3’) that 
Ao B=limA, 0 B 
eL0 
= lim All* (hay AD *BAL V2) All 
€. 
= lim Ag? f(Agt? BAL?) Ae? 
€. 
= lim AU Ta A VARA) Al 
€. 


= lim Ae 0 B 
el0 


= AooB, 


where Ac = A+elI. That is 7; = 05. Therefore, there is a bijection between OM(IRT) and 
BO(M1, M2, M3’). Every element in BO(M1, M2, M3’) admits an integral representation (15). 
Since the weighed harmonic mean possesses the joint continuity (M3), so is any element in 
BO(M1, M2, M3’). 


The next theorem is a fundamental result of [17]. 
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Theorem 0.26. There is a one-to-one correspondence between connections 0 and operator monotone 
functions f on the non-negative reals satisfying 


f(tj)I=Io(tl, teR. (16) 
There is a one-to-one correspondence between connections o and finite Borel measures v on [0,00] 
satisfying 
AcB ay PHA: B)du(t), A,B >0. (17) 
0,00 


Moreover, the map o ++ f is an affine order-isomorphism between connections and non-negative 
operator monotone functions on R*. Here, the order-isomorphism means that when 0; +> fi for 
i=1,2,A0,B< Ao>2B forall A,B € B(H)* ifand only if f, < fr. 


Each connection 7 on B(#)* produces a unique scalar function on IR*, denoted by the same 
notation, satisfying 

(sot)I =(sI)o (tl), s,f£€R™. (18) 
Let s,t € Rt. Ifs > 0, thensot = sf(t/s). Ift > 0,thensct = tf(s/t). 
Theorem 0.27. There is a one-to-one correspondence between connections and finite Borel measures 
on the unit interval. In fact, every connection takes the form 


AcB= | AuBdu(t), A,B >0 (19) 
0,1 


for some finite Borel measure 1 on [0,1]. Moreover, the map jt ++ @ is affine and order-preserving. 
Here, the order-presering means that when 1; > 0; (i=1,2), if w4(E) < p2(E) for all Borel sets E in 
[0,1], then Ao, B < AoyB forall A,B € B(H)™. 


Proof. The proof of the first part is contained in the proof of Theorem 0.22. This map is affine 
because of the linearity of the map +> f f dp on the set of finite positive measures and the 
bijective correspondence between connections and Borel measures. It is straight forward to 
show that this map is order-preserving. 


Remark 0.28. Let us consider operator connections from electrical circuit viewpoint. A 
general connection represents a formulation of making a new impedance from two given 
impedances. The integral representation (19) shows that such a formulation can be described 
as a weighed series connection of (infinite) weighed harmonic means. From this point of 
view, the theory of operator connections can be regarded as a mathematical theory of electrical 
circuits. 


Definition 0.29. Let 7 be a connection. The operator monotone function f in (16) is called the 
representing function of o. If ¢ is the measure corresponds to 7 in Theorem 0.27, the measure 
yy! that takes a Borel set E in [0, 00] to (1 (E)) is called the representing measure of o in the 
Kubo-Ando’s theory. Here,  : [0,1] — [0,00] is a homeomorphism tf +> t/(1—f). 


Since every connection ¢ has an integral representation (19), properties of weighed harmonic 
means reflect properties of a general connection. Hence, every connection o satisfies the 
following properties for all A, B > 0,T € B(H) and invertible X € B(H): 
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¢ transformer inequality: T*(Ag B)T < (T* AT) 0 (T*BT); 
* congruence invariance: X*(Ag B)X = (X* AX) a (X*BX); 
* concavity: (tA + (1—t)B) a (tA! + (1—#)B’) > t(Ao A’) + (1—#)(BoB’) fort € [0,1]. 


Moreover, if A, B > 0, 
AoB=A'/?¢(A-V2BA-V/2) Ali? (20) 
and, in general, for each A, B > 0, 


AogB=limAco Be (21) 
el0 


where Ace = A+ el and Be = B + el. These properties are useful tools for deriving operator 
inequalities involving connections. The formulas (20) and (21) give a way for computing the 
formula of connection from its representing function. 


Example 0.30. 1. The left- and the right-trivial means have representing functions given by 
t++ land t+-+ t, respectively. The representing measures of the left- and the right-trivial 
means are given respectively by d9 and do where 6; is the Dirac measure at x. So, the 
a-weighed arithmetic mean has the representing function f +> (1—«) + at and it has 
(1 — )do + adco as the representing measure. 

2. The geometric mean has the representing function ft > f1/2, 

3. The harmonic mean has the representing function ft +> 2t/(1+t) while f + t/(1+f) 
corrsponds to the parallel sum. 


Remark 0.31. The map 7 ++ wp, where yp is the representing measure of 7, is not 
order-preserving in general. Indeed, the representing measure of V is given by pw = (69 + 
5o0)/2 while the representing measure of ! is given by 5;. We have ! < V but 61 £ HL. 


6. Operator means 


According to [24], a (scalar) mean is a binary operation M on (0,00) such that M(s,t) lies 
between s and ¢ for any s,t > 0. For a connection, this property is equivalent to various 
properties in the next theorem. 


Theorem 0.32. The following are equivalent for a connection o on B(H)t: 


(i) o satisfies the betweenness property, i.e. A<B>A<ACB<B. 

(ii) o satisfies the fixed point property, ie. Ac A = A forall A € B(H)*. 
(iti) o is normalized, i.e. Io I = I. 

(iv) the representing function f of o is normalized, i.e. f(1) = 1. 

(v) the representing measure yt of 7 is normalized, i.e. pt is a probability measure. 


Proof. Clearly, (i) = (iii) > (iv). The implication (iii) = (ii) follows from the congruence 
invariance and the continuity from above of 7. The monotonicity of 7 is used to prove (ii) > 
(i). Since 


lol= i= I Ldy(t) = (0, Y])L, 
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we obtain that (iv) > (v) = (iii). 
Definition 0.33. A mean is a connection satisfying one, and thus all, of the properties in the 
previous theorem. 

Hence, every mean in Kubo-Ando’s sense satisfies the desired properties (A1)—(A9) in Section 
3. As a consequence of Theorem 0.32, a convex combination of means is a mean. 

Theorem 0.34. Given a Hilbert space H, there exist affine bijections between any pair of the following 
objects: 

(i) the means on B(H)*, 

(ii) the operator monotone functions f : Rt —> IR* such that f(1) = 1, 

(iii) the probability Borel measures on [0, 1]. 


Moreover, these correspondences between (i) and (ti) are order isomorphic. Hence, there exists an affine 
order isomorphism between the means on the positive operators acting on different Hilbert spaces. 


Proof. Follow from Theorems 0.27 and 0.32. 


Example 0.35. The left- and right-trivial means, weighed arithmetic means, the geometric 
mean and the harmonic mean are means. The parallel sum is not a mean since its representing 
function is not normalized. 


Example 0.36. The function ¢ +> t* is an operator monotone function on R* for each a € [0,1] 
by the Léwner-Heinz’s inequality. So it produces a mean, denoted by #,, on B(H)*. By the 
direct computation, 


s#y t= sh, (22) 


ie. #, is the a-weighed geometric mean on Rt. So the a-weighed geometric mean on R* is 
really a Kubo-Ando mean. The a-weighed geometric mean on B(H)* is defined to be the mean 
corresponding to that mean on IR™. Since f* has an integral expression 


_ sina7 / tA¢—1 
0 


tu 
t+A 


~ dm(A) (23) 


(see [7]) where m denotes the Lebesgue measure, the representing measure of #, is given by 


_ sin a7 As-l 


du(A) = dm(A). (24) 


Example 0.37. Consider the operator monotone function 


t 
—— > ‘ 
sear ea t>0,a € [0,1] 


The direct computation shows that 


oe { ((1—a)s—1 + at!)—1, s,t > 0; 
slyt= 


0, otherwise, (25) 
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which is the a-weighed harmonic mean. We define the «-weighed harmonic mean on B(H)* to 
be the mean corresponding to this operator monotone function. 


Example 0.38. Consider the operator monotone function f(t) = (f—1)/logt fort > 0,f 41, 
f(0) = Oand f(1) = 1. Then it gives rise to a mean, denoted by A, on B(H)*. By the direct 


computation, 
s—t ; 
Jogsalogt S > E> 0,8 At; 
sAt= S, s=t (26) 
0, otherwise, 


i.e. A is the logarithmic mean on Rt. So the logarithmic mean on Rt is really a mean in 
Kubo-Ando’s sense. The logarithmic mean on B(H.)* is defined to be the mean corresponding 
to this operator monotone function. 


Example 0.39. The map ft +> (t’ + f'~")/2 is operator monotone for any r € [0,1]. This 


function produces a mean on B(H.)*. The computation shows that 
gti-r 4 gl-ryr 
(s,£) is 


However, the corresponding mean on B(H)* is not given by the formula 


A’Bi-r a A!l-'Br 


(A,B) KH 5 


(27) 


since it is not a binary operation on B(#.)*. In fact, the formula (27) is considered in [8], called 
the Heinz mean of A and B. 


Example 0.40. For each p € [—1,1] and @ € [0,1], the map 
fry [(1—a) + atP]}/P 


is an operator monotone function on R*. Here, the case p = 0 is understood that we take 
limit as p + 0. Then 


s#at = ((1—a)s? + at? ]l/?. (28) 


The corresponding mean on B(#)* is called the quasi-arithmetic power mean with parameter 
(p, a), defined for A > 0 and B > 0 by 


A#paB = AM2((1—a)1+0(A-M/2BAV2)P]I/P Al/2, (29) 


The class of quasi-arithmetic power means contain many kinds of means: The mean #},, is the 
«-weighed arithmetic mean. The case #94 is the a-weighed geometric mean. The case #_1,, is 
the a-weighed harmonic mean. The mean #, 1/2 is the power mean or binomial mean of order p. 
These means satisfy the property that 


AfipaB = Biya» A. (30) 
Moreover, they are interpolated in the sense that for all p,q,a € [0,1], 


(A#r,p B) Hr a (A Hq B) = A#, (1—w)ptaq B. (31) 
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Example 0.41. If 71,02 are means such that 0; < 0, then there is a family of means that 
interpolates between o; and 07, namely, (1 — «)o1 + «0 for all w € [0,1]. Note that the map 
a +» (1—«)o1 + ao is increasing. For instance, the Heron mean with weight « € [0,1] is 
defined to be hy = (1—a)#-+ aV. This family is the linear interpolations between the 
geometric mean and the arithmetic mean. The representing function of hy is given by 


i> (-eF"4+ 040). 


The case a = 2/3 is called the Heronian mean in the literature. 


7. Applications to operator monotonicity and concavity 


In this section, we generalize the matrix and operator monotonicity and concavity in the 
literature (see e.g. [3, 9]) in such a way that the geometric mean, the harmonic mean or specific 
operator means are replaced by general connections. Recall the following terminology. A 
continuous function f : I —> R is called an operator concave function if 


f(tA + (1—#)B) > tf(A) + (1— t)f(B) 


for any ¢ € [0,1] and Hermitian operators A,B € B(H) whose spectrums are contained in 
the interval J and for all Hilbert spaces 1. A well-known result is that a continuous function 
f : Rt > Ris operator monotone if and only if it is operator concave. Hence, the maps 
t+ ft’ and t ++ logt are operator concave for r € [0,1]. Let H and K be Hilbert spaces. A 
map ® : B(H.) — B(K) is said to be positive if ®(A) > 0 whenever A > 0. It is called unital if 
(I) = I. We say that a positive map ® is strictly positive if ®(A) > 0 when A > 0. A map ¥ 
from a convex subset C of B(H.)*" to B()** is called concave if for each A,B € C andt € [0,1], 


¥ (EA + (1—#)B) > f¥(A) + (1-4) ¥(B). 


A map Y¥ : B(H)*" — B(K)* is called monotone if A < B assures ¥(A) < ¥(B). So, in 
particular, the map A ++ A" is monotone and concave on B(#)t for each r € [0,1]. The map 
A++ log A is monotone and concave on B(H)TT. 


Note first that, from the previous section, the quasi-arithmetic power mean (A,B) +> A#p,« B 
is monotone and concave for any p € [—1,1] and # € [0,1]. In particular, the following are 
monotone and concave: 

(i) any weighed arithmetic mean, 

(ii) any weighed geometric mean, 

(iii) any weighed harmonic mean, 

(iv) the logarithmic mean, 


(v) any weighed power mean of order p € [—1,1]. 


Recall the following lemma from [9]. 


Lemma 0.42 (Choi's inequality). If ® : B(H) — B(K) is linear, strictly positive and unital, then 
for every A > 0, ®(A)“! < ®(A7}). 
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Proposition 0.43. If ® : B(H) — B(K) is linear and strictly positive, then for any A,B > 0 
(A)@(B)~'@(A) < &(AB~'A). (32) 
Proof. For each X € B(H), set ¥(X) = &(A)~1/2@(Al/2XA!/?) (A) 1/2. Then ¥ is a unital 


strictly positive linear map. So, by Choi’s inequality, ¥(A)~! < ¥(A7!) for all A > 0. For 
each A,B > 0, we have by Lemma 0.42 that 


(A)!/2@(B)-1@(A)/? = ¥( AM? BA-V2)-1 
<¥ (aaa ty) 


= &(A)~1/2@(AB-1A)®(A)-1/?. 


So, we have the claim. 


Theorem 0.44. If ® : B(H) — B(K) is a positive linear map which is norm-continuous, then for 
any connection a on B(KC)* and for each A,B > 0, 


(Ag B) < &(A) o (B). (33) 
If, addition, ® is strongly continuous, then (33) holds for any A,B > 0. 
Proof. First, consider A,B > 0. Assume that © is strictly positive. For each X € B(H), set 
W(X) = &(B)-1/2@(BY/2xB!/?) p(B) 1/2. 
Then ¥ is a unital strictly positive linear map. So, by Choi’s inequality, ¥(C)~! < ¥(C~!) for 
all C > 0. For each t € [0,1], put X; = B~1/2(A!; B)B~ 1/2 > 0. We obtain from the previous 
proposition that 
(A!; B) = &(B)!/2¥(xX;)6(B)!/? 

< 0(B)'/?[¥(X;1)] 7B)? 

= &(B)[®(B((1— t)A~! + tB-)B)|-!&(B) 

= ®(B)[(1 — t)®(BA~'B) + t®(B)|~!&(B) 

< ©(B)[(1— £)@(B)®(A)-10(B) + 10(B)]-*(B) 

= (A) |; ®(B). 


For general case of ®, consider the family ®.(A) = ®(A) + eI where e > 0. Since the map 
(A,B) ++ AB = [(1—t)A~! +#B71]~1 is norm-continuous, we arrive at 


(A!;B) < &(A) 1 &(B). 
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For each connection @, since ® is a bounded linear operator, we have 


®(AcB) = Oy A! Bdy(t)) = = ®(A!; B) du(t) 


7 


< i 8A) 4 O(B) du(t) = @(A)o@(B), 
0,1 
Suppose further that ® is strongly continuous. Then, for each A, B > 0, 
®(AcB) = oe +eI)o(B+el)) = pile +eI)o(B+el)) 
& & 


< lim (A + €1) 7 ®(B + el) = ®(A) 0 (8). 
é. 


The proof is complete. 


As a special case, if ® : M,(C) + Mn(C) is a positive linear map, then for any connection 7 
and for any positive semidefinite matrices A,B € Mn(C), we have 


®(AcB) < ®(A)c ®(B). 
In particular, ®(A) #p,. ®(B) < ®(A) #p« P(B) for any p € [—1,1] and a € (0,1). 
Theorem 0.45. If ®,,®2 : B(H)* > B(K)* are concave, then the map 
(Ay, Ap) 4 ©1(Ay) 7 ©9(Ap) (34) 


is concave for any connection 0 on B(K)*. 


Proof. Let Ay, A}, Az, As > Oand t € [0,1]. The concavity of ©; and ®) means that for i = 1,2 
@; (tA; + (1 — t) Aj) > t&;(A;) + (1 — £)®;(A}). 
It follows from the monotonicity and concavity of 7 that 
@1 (tA; + (1—#)A}) 7 Bo(tAz + (1 — #) AS) 


[t® (Ai) + (1 — #)®1(A})] @ [#@2 (Az) + (1 — #)@2(AQ)] 


> 
2 t[@1 (Ay) 7 B2(Az)] + (1 — t)[@1(A1) 7 Bo (Az). 


This shows the concavity of the map (A;, Az) > ®1(A;) 7 ®2(Ap2) . 


In particular, if ©; and 2 are concave, then so is (A,B) ++ ®1(A) #),¢®2(B) for p € [-1,1 
and a € [0,1]. 


Corollary 0.46. Let 7 be a connection. Then, for any operator monotone functions f,g :IRt > Rt, 
the map (A,B) ++ f(A) a g(B) is concave. In particular, 


(1) the map (A,B) ++ A’ a BS is concave on B(H)* for any r,s € [0,1], 
(2) the map (A, B) + (log A) o (log B) is concave on B(H)**. 
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Theorem 0.47. If ®1,®2 : B(H)* > B(K)* are monotone, then the map 

(Ai, Az) 4 (Ay) 7 Bo(Az) (35) 
is monotone for any connection 0 on B(K)*. 
Proof. Let Ay < At and Ag < Ab. Then ®,(A;) < ®;(A}) and ®y(A2) < ®(A}) by 


the monotonicity of ®; and ®). Now, the monotonicity of a forces ®;(A,)7@2(A2) < 
®,(A}) o @(As). 


In particular, if &; and ©2 are monotone, then so is (A, B) ++ ©1(A) #p,.®2(B) for p € [-1,1 
and a € [0,1]. 


Corollary 0.48. Let o be a connection. Then, for any operator monotone functions f,g: Rt > RT, 
the map (A,B) ++ f(A) a g(B) is monotone. In particular, 


(1) the map (A,B) ++ A’ o BS is monotone on B(H)* for any r,s € [0,1], 
(2) the map (A,B) ++ (log A) o (log B) is monotone on B(H) tT. 


Corollary 0.49. Let o be a connection on B(H)*. If ®1,®2 : B(H)* — B(H)* is monotone and 
strongly continuous, then the map 


(A,B) + ®,(A) 7 &2(B) (36) 
is a connection on B(H.)*. Hence, the map 
(A,B) +> f(A) 7 g(B) (37) 


is a connection for any operator monotone functions f,g:IRt + Rt. 


Proof. The monotonicity of this map follows from the previous result. It is easy to see that this 
map Satisfies the transformer inequality. Since ©; and ®2 strongly continuous, this binary 
operation satisfies the (separate or joint) continuity from above. The last statement follows 
from the fact that if An | A, then Sp(An) C [0, ||A1||] for all m and hence f(An) — f(A). 


8. Applications to operator inequalities 


In this section, we apply Kubo-Ando’s theory in order to get simple proofs of many classical 
inequalities in the context of operators. 


Theorem 0.50 (AM-LM-GM-HM inequalities). For A,B > 0, we have 


A!IB< A#B< AABS<AVB. (38) 


Proof. It is easy to see that, for each t > 0,¢ £1, 


eT ee 


1+t “ logt ~ 2 


Now, we apply the order isomorphism which converts inequalities of operator monotone 
functions to inequalities of the associated operator connections. 
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Theorem 0.51 (Weighed AM-GM-HM inequalities). For A,B > 0 and « € [0,1], we have 


AlgB< Atty B< AVgB. (39) 


Proof. Apply the order isomorphism to the following inequalities: 


t 


re ae ee t, £>0. 


The next two theorems are given in [21]. 
Theorem 0.52. For eachi = 1,--- ,n, let A;,B; € B(H)*. Then for each connection 


n n n 


\ (Ajo B;) Le 7 LB. (40) 


i=1 i=1 


Proof. Use the concavity of 7 together with the induction. 


By replacing 7 with appropriate connections, we get some interesting inequalities. 


(1) Cauchy-Schwarz’s inequality: For A;,B; € B(H)**, 


n n n 
\" AP HB? < A? # YY BP. (41) 
i=1 i=1 


i=1 


(2) Hélder’s inequality: For A;, B; € B(H)+ and p,q > O such that 1/p+1/q =1, 
n n nn 
FAP typ BY < So AP ip SB es 
i=1 i=1 i=1 


(3) Minkowski’s inequality: For A;,B; € B(H)*t 


a =i n -1 n “1 
(S04 + a") > (x 4") + (x 5) : a 
i=1 i=1 i=1 


Theorem 0.53. Let A;,B; € B(H)*,i =1,--- ,n, be such that 
Mate SO He BH Bie SG 


Then 


n n n 
A, oB,— )) Ajo Bi > (41-24) o (Za). (44) 
i=2 i=2 i=2 


Proof. Substitute A; to Ay — Az —---— Ay and By to By — By —--- — By in (40). 
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Here are consequences. 
(1) Aczél’s inequality: For A;,B; € B(H)°*, if 
Al—As—---—A2>0 and B?—Bl—.--—B2>0, 
then 
n n n 
At#Bi—Y° A? #B? > | AT— YA?) # ( BP - BB? |. (45) 
i=2 i=2 i=2 


(2) Popoviciu’s inequality: For A;,B; € B(H)*t and p,q > 0 such that 1/p+1/q = 1, if 
p,q > Oare such that 1/p+1/q =1and 


AP-Ap—----A;>0 and Bi—Bi_—..-—Bi>0, 


then 


i=2 


n n 
p q p q p p q q 
Al #1 /p BL — DA} #1 yp Bi > (ai a ba") #1/p (2 - st) . (46) 


(3) Bellman’s inequality: For Aj, B; € B(H)*™, if 


Ady ar As SU and 8 8, eB, Se 
then 
P -1 ‘5 1 a = 
(Ay? + By!) Ll4i By < (4"-Lar) + (a'-Ear) e an 


The mean-theoretic approach can be used to prove the famous Furuta’s inequality as follows. 
We cite [14] for the proof. 


Theorem 0.54 (Furuta’s inequality). For A > B > 0, we have 
(BY APBT)!/4 > Blpt2r)/q (48) 
Alpt2r)/q > (A’BP AT)!/9 (49) 


where r > 0,p >0,q >1and (1+ 2r)q > p+2r. 


Proof. By the continuity argument, assume that A,B > 0. Note that (48) and (49) are 
equivalent. Indeed, if (48) holds, then (49) comes from applying (48) to A’! < Bl and 
taking inverse on both sides. To prove (48), first consider the case 0 < p < 1. We have 
BP+2r — BrBPBY < Br APB" and the Lowner-Heinz’s inequality (LH) implies the desired result. 
Now, consider the case p > 1 and q = (p+ 2r)/(1+ 2r), since (48) for q > (p+ 2r)/(1+ 2r) 
can be obtained by (LH). Let f(t) = t!/7 and let o be the associated connection (in fact, 
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o=#, /q)- Must show that, for any r > 0, 
Bo AP SB. (50) 
ForO<r< i, we have by (LH) that A" > B" and 
BO? g AP > A-* go AP = A-*(1-1/8) aP/d — A> B= B-* o BP. 


Now, set s = 2r+ 5 and q; = (p+2s)/(1+2s) > 1. Let fy(t) = t!/7 and consider the 
associated connection 7. The previous step, the monotonicity and the congruence invariance 
of connections imply that 
B-2s 0} AP = Bie ral (B" APB")|B-" 

> B-T[(B’APB’)-V/n a, (B'APB’)|B* 

= B-"(B’ APB’)!/4B-" 

> BoB! +2rB-T 

= B. 


Note that the above result holds for A,B > 0 via the continuity of a connection. The desired 
equation (50) holds for all r > 0 by repeating this process. 
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Recent Research 
on Jensen’s Inequality for Operators 


Jadranka Mici¢ and Josip Peéari¢ 
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1. Introduction 


The self-adjoint operators on Hilbert spaces with their numerous applications play an 
important part in the operator theory. The bounds research for self-adjoint operators is a very 
useful area of this theory. There is no better inequality in bounds examination than Jensen’s 
inequality. It is an extensively used inequality in various fields of mathematics. 


Let I be a real interval of any type. A continuous function f : I —> R is said to be operator 
convex if 

f Ax + (1—Ajy) < Af (x) + (1 -A)FY) (1) 
holds for each A € [0,1] and every pair of self-adjoint operators x and y (acting) on an infinite 
dimensional Hilbert space H with spectra in I (the ordering is defined by setting x < yify—x 
is positive semi-definite). 


Let f be an operator convex function defined on an interval I. Ch. Davis [1] proved! a Schwarz 


inequality 

Ff (p(x) So (F(x) (2) 
where ~: A — B(K) is a unital completely positive linear mapping from a C*-algebra A to 
linear operators on a Hilbert space K, and x is a self-adjoint element in A with spectrum in I. 
Subsequently M. D. Choi [2] noted that it is enough to assume that ¢ is unital and positive. 
In fact, the restriction of ¢ to the commutative C*-algebra generated by x is automatically 
completely positive by a theorem of Stinespring. 


F. Hansen and G. K. Pedersen [3] proved a Jensen type inequality 


f (= se) < Yat f(x)q; (3) 
i=1 i=1 


| There is small typo in the proof. Davis states that ¢ by Stinespring’s theorem can be written on the form ¢(x) = Pp(x)P 
where p is a *-homomorphism to B(H) and P is a projection on H. In fact, H may be embedded in a Hilbert space K on 
which p and P acts. The theorem then follows by the calculation f(¢(x)) = f(Pp(x)P) < Pf(p(x))P = Pp(f(x)P = 
$(f (x)), where the pinching inequality, proved by Davis in the same paper, is applied. 


190 Linear Algebra — Theorems and Applications 


for operator convex functions f defined on an interval I = [0,a) (with a < o and f(0) < 
0) and self-adjoint operators x1,...,%n with spectra in I assuming that )7_, afa; = 1. The 
restriction on the interval and the requirement f(0) < 0 was subsequently removed by B. 
Mond and J. Peéari¢ in [4], cf. also [5]. 


The inequality (3) is in fact just a reformulation of (2) although this was not noticed at the 
time. It is nevertheless important to note that the proof given in [3] and thus the statement 
of the theorem, when restricted to n x n matrices, holds for the much richer class of 2 x 
2n matrix convex functions. Hansen and Pedersen used (3) to obtain elementary operations 
on functions, which leave invariant the class of operator monotone functions. These results 
then served as the basis for a new proof of Lowner’s theorem applying convexity theory and 
Krein-Milman’s theorem. 


B. Mond and J. Peéari¢ [6] proved the inequality 
f (s wps)) < Li widi( f(xi)) (4) 


for operator convex functions f defined on an interval I, where ¢; : B(H) — B(K) are unital 
positive linear mappings, x1,...,Xy are self-adjoint operators with spectra in I and w ,...,Wn 
are are non-negative real numbers with sum one. 


Also, B. Mond, J. Peéari¢, T. Furuta et al. [6-11] observed conversed of some special case 
of Jensen’s inequality. So in [10] presented the following generalized converse of a Schwarz 
inequality (2) 


F[p (F(A), ((A))] S_max F | f(m) 4 Aer aTe 


,o(t)| la 5 
By m),g(t)| ln (5) 
for convex functions f defined on an interval [m,M], m < M, where g is a real valued 
continuous function on [m,M], F(u,v) is a real valued function defined on U x V, matrix 
non-decreasing in u, U D f[m,M], V D g{m,M], ¢ : Hy — Hg is a unital positive linear 
mapping and A is a Hermitian matrix with spectrum contained in [m, M]. 


There are a lot of new research on the classical Jensen inequality (4) and its reverse inequalities. 
For example, J.I. Fujii et all. in [12, 13] expressed these inequalities by externally dividing 
points. 


2. Classic results 


In this section we present a form of Jensen’s inequality which contains (2), (3) and (4) as special 
cases. Since the inequality in (4) was the motivating step for obtaining converses of Jensen’s 
inequality using the so-called Mond-Peéari¢ method, we also give some results pertaining to 
converse inequalities in the new formulation. 


We recall some definitions. Let T be a locally compact Hausdorff space and let A be a 
C*-algebra of operators on some Hilbert space H. We say that a field (x;);<7 of operators 
in A is continuous if the function f ++ x; is norm continuous on T. If in addition p is a Radon 
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measure on T and the function t ++ ||x;|| is integrable, then we can form the Bochner integral 
J xt 4u(t), which is the unique element in A such that 


o( fant) = fo) dnl 


for every linear functional ¢ in the norm dual A*. 


Assume furthermore that there is a field (¢;);<7 of positive linear mappings ¢ : A > B 
from A to another C*-algebra 6 of operators on a Hilbert space K. We recall that a linear 
mapping ¢; : A — B is said to be a positive mapping if ¢;:(x;) > 0 for all x; > 0. We say 
that such a field is continuous if the function t ++ ¢;(x) is continuous for every x € A. Let 
the C*-algebras include the identity operators and the function t ++ ¢;(1) be integrable with 
Jr ¢t(1H) du(t) = k1x for some positive scalar k. Specially, if [7 ~+(1H) du(t) = 1x, we say 
that a field (#t)te7 is unital. 


Let B(H) be the C*-algebra of all bounded linear operators on a Hilbert space H. We define 
bounds of an operator x € B(H) by 


my = inf (x@,) and My = sup (xé,2) (6) 
\gll=1 \gl|=1 


for ¢ € H. If Sp(x) denotes the spectrum of x, then Sp(x) C [myx, My]. 


For an operator x € B(H) we define operators |x|, xt, x~ by 


lx] = (xtx)2, xt = (|x| +x)/2, 9 x7 = (|x| —x)/2 


1/2 


Obviously, if x is self-adjoint, then |x| = (x*)!/* and x+,x7 > 0 (called positive and negative 


parts of x = xt — x7). 


2.1. Jensen’s inequality with operator convexity 


Firstly, we give a general formulation of Jensen’s operator inequality for a unital field of 
positive linear mappings (see [14]). 


Theorem 1. Let f : I — R be an operator convex function defined on an interval I and let A and 
B be unital C*-algebras acting on a Hilbert space H and K respectively. If (ft)ter is a unital field of 
positive linear mappings ¢ : A — B defined on a locally compact Hausdorff space T with a bounded 
Radon measure 1, then the inequality 


F(fgntond ante) < [ocFln) dn ”) 


holds for every bounded continuous field (x+);<7 of self-adjoint elements in A with spectra contained 
in I. 


Proof. We first note that the function t +> @;(x:) € B is continuous and bounded, hence 
integrable with respect to the bounded Radon measure p. Furthermore, the integral is an 
element in the multiplier algebra M(B) acting on K. We may organize the set CB(T,A) of 
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bounded continuous functions on T with values in A as a normed involutive algebra by 
applying the point-wise operations and setting 


Il(yt)terll = sup lvell = (Ye)ter € CB(T, A) 
€ 


and it is not difficult to verify that the norm is already complete and satisfy the C*-identity. 
In fact, this is a standard construction in C*-algebra theory. It follows that f((xt)ter) = 
(f (xt) )ter- We then consider the mapping 


7: CB(T, A) -> M(B) C B(K) 


defined by setting 
r((x)rer) =f, ge(xs) dq(t) 


and note that it is a unital positive linear map. Setting x = (x1)rer € CB(T,A), we use 
inequality (2) to obtain 


f(t ((x)rer)) = F(X) < AUF) = 1 (F((xer)) = 7 (FC) ser) 


but this is just the statement of the theorem. 


2.2. Converses of Jensen’s inequality 


In the present context we may obtain results of the Li-Mathias type cf. [15, Chapter 3] and 
[16, 17]. 


Theorem 2. Let T be a locally compact Hausdorff space equipped with a bounded Radon measure 
yu. Let (xt)ter be a bounded continuous field of self-adjoint elements in a unital C*-algebra A with 
spectra in [m, M],m < M. Furthermore, let (@1)ter be a field of positive linear mappings g; : A > 
B from A to another unital C*—algebra B, such that the function t ++ ¢4(1y) is integrable with 
Jr +(x) du(t) = k1x for some positive scalar k. Let my and My, my < Mx, be the bounds of the 
self-adjoint operator x = [> p¢(xz) du(t) and f : [m,M] > R,g: [mx,Mx] 7 R,F:UxV>OR 
be functions such that (kf) ([m,M]) C U, g([mx,Mx]) C V and F is bounded. If F is operator 
monotone in the first variable, then 


vag FL in (G2 ye g(z 2) te sF | f aC (xr)) du(t) is (fo xp) dy w)| 


< sup F [kn (;2) s(2) 1x ei 


My <Z<My 
holds for every operator convex function hy on [m, M] such that hy < f and for every operator concave 


function hz on [m, M] such that hz > f. 


Proof. We prove only RHS of (8). Let hy be operator concave function on [m,M] such that 
f(z) < hp(z) for every z € [m, M]. By using the functional calculus, it follows that f(x) < 
ho (xp) for every t € T. Applying the positive linear mappings ¢; and integrating, we obtain 


[or f(xt)) du(t) < [ml ha(xt)) du(t) 
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Furthermore, replacing ¢; by rot in Theorem 1, we obtain t : ft (ho (xt)) du(t) < 
1 Seis 1 : 
hy € [me ano), which gives |. (f(xt)) du(t) < k- hg (= [mxoan), Since 


mx 1K < fr pe(xe)du(t) < Mx 1x, then using operator monotonicity of F(-,7) we obtain 


F| f o(rea)antt)s (J, aeoxndute) 0) 
<F [ke in; [oe x¢) du(t )) 8 (f Pe(xp)dp( »)| < aoe [ka @ i3(2) 1x 


Applying RHS of (8) for a convex function f (or LHS of (8) for a concave function f) we obtain 
the following generalization of (5). 


Theorem 3. Let (xt)ter, Mx, Mx and ($¢)ter be as in Theorem 2. Let f : [m,M] > R,g: 
[mx, Mx] + R, F: Ux V > R be functions such that (kf) ({m,M]) C U, ¢({mx,Mx]) C V and 
F is bounded. If F is operator monotone in the first variable and f is convex on the interval [m, M], 


_ FL [me oe (F(x+)) du(t) is (fo Xr )dy( »)| 


Mk — z z—km (10) 
| |r 


< sup F 


My <Z< My 


Fat (m) + FM), g(2) 


In the dual case (when f is concave) the opposite inequalities hold in (10) with inf instead of sup. 


Proof. We prove only the convex case. For convex f the inequality f(z) < =%f(m)+ 
i725 f(M) holds for every z € [m,M]. Thus, by putting My(z) = 4=2f(m) + AS f(M 
in (9) we obtain (10). 


Numerous applications of the previous theorem can be given (see [15]). Applying Theorem 3 
for the function F(u,v) = u— av and k = 1, we obtain the following generalization of [15, 
Theorem 2.4]. 


Corollary 4. Let (xt):e7, Mx, Mx be as in Theorem 2 and ($¢);c7 be a unital field of positive linear 
mappings g¢, : A — B. If f : [m,M] — R is convex on the interval [m,M], m < M, and g: 
[m, M] — R, then for anya € R 


[om ron)ante) sag (f glxoau(e)) + C18 ay 
where 
C= max | call) + 5p (M) —ag(2)} 


< max, {= fm) + Z—™ f(a) —ag(e)} 


m<z<M 
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If furthermore wg is strictly convex differentiable, then the constant C = C(m,M,f,g,«) can be 
written more precisely as 


C= 2 F(m) 4" FM) — ag(z0) 
where 
gf (LAME L OM) iF ag! (mg) < ALO < ae!) 
z0 = { my if ag! (my) > Lf 
My if, og!(My) < Lape fow) 


In the dual case (when f is concave and «xg is strictly concave differentiable) the opposite inequalities 
hold in (11) with min instead of max with the opposite condition while determining Zz . 


3. Inequalities with conditions on spectra 


In this section we present Jensens’s operator inequality for real valued continuous convex 
functions with conditions on the spectra of the operators. A discrete version of this result is 
given in [18]. Also, we obtain generalized converses of Jensen’s inequality under the same 
conditions. 


Operator convexity plays an essential role in (2). In fact, the inequality (2) will be false if 
we replace an operator convex function by a general convex function. For example, M.D. 
Choi in [2, Remark 2.6] considered the function f(t) = t* which is convex but not operator 
convex. He demonstrated that it is sufficient to put dimH = 3, so we have the matrix case as 
follows. Let ® : M3(C) — M2(C) be the contraction mapping ®((4j)1<i,j<3) = (aij)1<i,j<2- If 


101 
A= [001], then ®(A)* = 10 L on = ®(A*) and no relation between ®(A)* and 
111 00 53 
®(A*) under the operator order. 


Example 5. It appears that the inequality (7) will be false if we replace the operator convex function 
by a general convex function. We give a small example for the matrix cases and T = {1,2}. We 
define mappings ®,,®2 : M3(C) — M2(C) by ®1((ajj)1<i,'<3) = 5 (di 1<ij<2, D>) = D,. Then 


®1 (13) + ®2 (13) = h. 
101 100 
X,=2{/001) and X,=2{(000 
111 000 


DT) If 
4 (160 80 40\ _ 4 4 
(1 (X1) + Bo(X2))" = € ;) & & 4) =O; (xt) + By (x3) 
Given the above, there is no relation between (®1(X1) + ®(X2))* and ®, (Xt) + Bo (XF 
under the operator order. We observe that in the above case the following stands X = ®(X,) + 
dere € :) and (mz,Mx] = [0,2], [11,4] C [—1.60388, 4.49396], [r2, Mo] = [0,2], 


1.€. 


then 


(my, Mx) C [my, My] U [m2, Mp] 
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(see Fig. 1.a). 


-@--@-2----~+-e----2e-- --@-- @----- @-e----- 
m Mm Mm M, M, M, mM, M& M, M, M, 
a) b) 


Figure 1. Spectral conditions for a convex function f 


-140 1 150 0 
xy= Q =2=1 and Xx,={020 
1 -1-1 0015 


cox) entrant = (9) < (2-20) = 9 (xt) +2 (4) 
So we have that an inequality of type (7) now is valid. In the above case the following stands 
X = ;(X}) +.©(X>) = (3 :) and [mx, My] = [0,0.5], [r1, My] C [-14.077, 0.328566], 
[m2, Mp2] = [2,15], ie. 
(my,Mx)  [my,My] =@ and (my,Mx) AO [m2,M2] = © 


ID) If 


then 


(see Fig. 1.b). 


3.1. Jensen’s inequality without operator convexity 


It is no coincidence that the inequality (7) is valid in Example 18-II). In the following theorem 
we prove a general result when Jensen’s operator inequality (7) holds for convex functions. 


Theorem 6. Let (x;)+e7 be a bounded continuous field of self-adjoint elements in a unital C*-algebra 
A defined on a locally compact Hausdorff space T equipped with a bounded Radon measure y.. Let my 
and Mt, my < Mt, be the bounds of x, t € T. Let (#t)ter be a unital field of positive linear mappings 
gt: A > B from A to another unital C* —algebra B. If 


(mx, Mx) N (me, Mz] =, teT 


where my and My, my < Mx, are the bounds of the self-adjoint operator x = St f(x¢) du(t), then 


F (fgets) autt)) < f go(ptae)) aye (12 


holds for every continuous convex function f : I + IR provided that the interval I contains all m;, M,. 


If f : I Ris concave, then the reverse inequality is valid in (12). 
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Proof. We prove only the case when f is a convex function. If we denote m = inf {ms} 
€ 
and M = sup{M;}, then [m,M] C I and mly < A; < Mly, t € T. It follows 
teT 


m1 < fr Ge(xt) du(t) < M1x. Therefore [my,Mx] C [m, M] CI. 
a) Let my < My. Since f is convex on [mx, Mx], then 


fs 7a 


f (myx) M, — az) Me), ZE [mx, Mx] (13) 


x — Mx 


——__—* (My), z€([m,M;], teT (14) 


Since mylx < f. 7 fi (xt) du(t) < My1x, then by using functional calculus, it follows from (13) 


F (feta) du(ey) < MKT rH) 4, RPA tle 


On the other hand, since m;ly < x; < Mily, t € T, then by using functional calculus, it 
follows from (14) 


f(Mx) (15) 


_ xXp—mMy1 
(2) We ee eT 


Applying a positive linear mapping ¢; and summing, we obtain 


[o (f(x) du(t) > Mx1x Fr) HO) Fn, | fri a) ~ mslk ay.) (16) 


since [7 p¢(1H) du(t) = 1x. Combining the two inequalities (15) and (16), we have the desired 
inequality (12). 


b) Let m, = My. Since f is convex on [m, M], we have 
f(z) > f(my) + 1(my)(z— my) for every z € [m, M] (17) 


where / is the subdifferential of f. Since mly < x; < M1y,¢t € T, then by using functional 
calculus, applying a positive linear mapping ¢; and summing, we obtain from (17) 


[ (FC) a(t) > Fm) + ms) ( ff eC) dele) — mer} 
Since my1x = fr $r(xr) du(t), it follows 


[oe (pla) duit) > flonsyte =F (foros) ante) 


which is the desired inequality (12). 


Putting ¢:(y) = ary for every y € A, where a; > 0 is a real number, we obtain the following 
obvious corollary of Theorem 6. 
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Corollary 7. Let (x+)+er be a bounded continuous field of self-adjoint elements in a unital C*-algebra 
A defined on a locally compact Hausdorff space T equipped with a bounded Radon measure yl. Let mt 
and Mz, mp < Mt, be the bounds of xt, t € T. Let (at)ter be a continuous field of nonnegative real 
numbers such that f-, a; dy(t) = 1. If 


(mx, Mx) a [me, Mz] = ©, teT 
where my and My, my < Mx, are the bounds of the self-adjoint operator x = f-, a,x; du(t), then 


F(faxeaut)) < f afta) ancr a) 


holds for every continuous convex function f : I + R provided that the interval I contains all m;, M;. 


3.2. Converses of Jensen’s inequality with conditions on spectra 
Using the condition on spectra we obtain the following extension of Theorem 3. 


Theorem 8. Let (x+)+er be a bounded continuous field of self-adjoint elements in a unital C*-algebra 

A defined on a locally compact Hausdorff space T equipped with a bounded Radon measure yp. 

Furthermore, let (@¢)1er be a field of positive linear mappings ¢; : A > B from A to another unital 

C*—algebra B, such that the function t + ; (14) is integrable with J ¢+(1H) du(t) = k1x for some 

positive scalar k. Let mand M;, mz < Mt, be the bounds of x;,t € T,m = inf {ms}, M= Sup Mey, 
€ 


and my and Mx, my < Mx, be the bounds of x = J- p+ (xt) du(t). If 

(mx, Mx)  [mt, Mt] = ©, teT 
and f : [m,M] > BR, g: [mx,M,] > R, F: Ux V > Rare functions such that (kf) ({m,M]) C 
U, ¢ ([mx,Mx]) C V, f is convex, F is bounded and operator monotone in the first variable, then 


. Myxk —Z rs km 
amy, F Fe K—? Fm) + = “ F(Mx), (2) iy 


| f o(ron)antt).s (f aeoxndute ) 
< sup FFE fm) + FM), g(2)| 1x 


My <Z< My 


In the dual case (when f is concave) the opposite inequalities hold in (19) by replacing inf and sup with 
sup and inf, respectively. 


Proof. We prove only LHS of (19). It follows from (14) (compare it to (16)) 


[oe (a4) du(t) > Myk1x ree du(t ) en oe Srp — My FAK 54.) 


since [7 ~(1H) du(t) = k1x. By using operator monotonicity of F(-,v) we obtain 


Fl [ot ge (F(xt)) du(t) is (fo xt) du(t )] 


> F | Mette feta) rim) + Wom f (Ma). 8 (Sr dea) an(t))| 
Myk — —kmx 
> inf F Fe = — f (my) My = f(Ms),s(2) 1k 


my<SZ<My 
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1/2 


Putting F(u,v) = u—av or F(u,v) = v !/2uv~ 1/2 in Theorem 8, we obtain the next corollary. 


Corollary 9. Let (x¢)rer, mt, Mt, mx, Mx, m, M, (ft)ter be as in Theorem 8 and f : |m,M] > R, 
g: [mx,Mx] — R be continuous functions. If 


(my, Mx) N (me, Mz] =, teT 


and f is convex, then for anya € R 


pity, {Ree flins) + Gn fMs) — 9(2) bx +g ( J ooxdeu(t) 
< [pe (flas)) adult) (20) 
<ag( J d(sdau(t)) +, max, {FE s(n) + Fe s(M) — s(@)} 14 


If additionally g > 0 on [m x, My], then 


My,k—z my) 4 z—km, My 
min Ma—nieS (Ms) + ie=mef( Ns I. ou(xdau(t)) 
k 


My <z<My g(z) 


Mk—z m z—km 
< fm rcordnt ogy, { BL ROO 6 acanay) 


My <Z<My 


(21) 


In the dual case (when f is concave) the opposite inequalities hold in (20) by replacing min and max 
with max and min, respectively. If additionally g > 0 on [mx, Mx], then the opposite inequalities also 
hold in (21) by replacing min and max with max and min, respectively. 


4. Refined Jensen’s inequality 


In this section we present a refinement of Jensen’s inequality for real valued continuous 
convex functions given in Theorem 6. A discrete version of this result is given in [19]. 


To obtain our result we need the following two lemmas. 


Lemma 10. Let f be a convex function on an interval I, m,M € I and py,p2 € [0,1] such that 
Pi + p2 = 1. Then 


min{ pr pa} | fm) +f) ~2f (“E™*)] < past) + pof(M)— f(rm+ pM) (2) 


Proof. These results follows from [20, Theorem 1, p. 717]. 


Lemma 11. Let x be a bounded self-adjoint elements in a unital C*-algebra A of operators on some 
Hilbert space H. If the spectrum of x is in |m, M], for some scalars m < M, then 


Miyp-x x—mly 


f(x) < MAH gem) 4 2H pay — 0/2 3) 
(resp. f (x) > MAH (my + XMM pay + 0/8) 
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holds for every continuous convex (resp. concave) function f : [m,M] + R, where 


5p = fm) + f(M) —2f (5M) (resp. b = 2f (MM) — Fm) — F(M)) 


xw—1y._ 1 
and X= 51H — yn 


x mM 1n| 


Proof. We prove only the convex case. It follows from (22) that 


f (pim + poM) < pif(m) + pof(M) 


(24) 
— min{ py, po} (f(m) + f(M) — 2F (™5™)) 
for every p1, p2 € [0,1] such that pj + po = 1. For any z € [m, M] we can write 
M— = 
fa)= 5 (2m + 2m) 
Then by using (24) for py = 4&2 and p) = 42" we obtain 
fle) < sm) + 2 fm) 
1 1 m+M m+M (25) 
Ca EE (emveran-sy(25%) 


since 


Finally we use the continuous functional calculus for a self-adjoint operator x: f,g € 
C(1),Sp(x) C Iand f < g on I implies f(x) < g(x); and h(z) = |z| implies h(x) = |x|. 
Then by using (25) we obtain the desired inequality (23). 


Theorem 12. Let (x;)e7 be a bounded continuous field of self-adjoint elements in a unital C*-algebra 
A defined on a locally compact Hausdorff space T equipped with a bounded Radon measure y. Let mt 
and My, my < Mt, be the bounds of x, t € T. Let (ft)ter be a unital field of positive linear mappings 
gt: A B from A to another unital C*—algebra B. Let 


(mx,Mx) [me Mik] = @, ft ET, and m<M 


where my and My, my < Mx, be the bounds of the operator x = fr $e(x¢) du(t) and 


m = sup {M;: Mi < my,t € T}, M = inf {m;: mp > Mx,t € T} 


If f : 1 + Risa continuous convex (resp. concave) function provided that the interval I contains all 
mt, M;, then 


F(fotaan) < [mre anly oes [osen)an) —— @6) 
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(resp. 
F(forcan)an(t)) = forced) dult) 52> fof) aul) 2”) 


holds, where 


by = df(t, M1) = fm) + f(M) — 2F (™5™) 
(resp. 57 = 5+ (mi, M) = 2F (5M) — Fm) — f(M) ) (28) 
¥ = X,(m,M) = 51x - ae |: nM | 


and m € [m,ma], M € [My,M], m < M, are arbitrary numbers. 


Proof. We prove only the convex case. Since x = f7:(x1) du(t) € B is the self-adjoint 
elements such that 71x < mylx < fr or(xr)du(t) < Mxlx < M1x and f is convex on 
[m, M] C I, then by Lemma 11 we obtain 


( l ies in) Z Sa a pon) 4 Le) HO = MK pry _ 5, 29) 


—m M-m 


where 6 f and x are defined by (28). 


But since f is convex on [m;, M;] and (mx, Mx) [mt, Mi] = © implies (1, M) 2M [m:, Me] = D, 
then - 
M1y x} — M14 


f (x1) > So em) + (M), eT 


Applying a positive linear mapping ¢;, integrating and adding —6 fx, we obtain 


«Mix — frpe(ae) du(t) | Sr Pe(ae) du(t) — mK, o 

> = { = 
[de (Fld) au(t) — 0¢8 > TPE SES (mm) oe  () 2 
since [7 p:(1H) du(t) = 1x. Combining the two inequalities (29) and (30), we have LHS of 
(26). Since 5 720 and x > 0, then we have RHS of (26). 


If m < Mand m, = My, then the inequality (26) holds, but bf (Mx, Mx) x X(mx, Mx) is not 
defined (see Example 13 I) and II)). 


Example 13. We give examples for the matrix cases and T = {1,2}. Then we have refined inequalities 
given in Fig. 2. We put f(t) = t* which is convex but not operator convex in (26). Also, we define 
mappings ®,,®2 : M3(C) — Mp(C) as follows: ®1 ((ajj)1<i,j<3) = 5 (Ai )1<ij<2, Dz = ®, (then 
®(J3) + ®2(I3) = h). 


I) First, we observe an example when 6 ip X is equal to the difference RHS and LHS of Jensen’s 
inequality. If X1 = —3I3 and Xy = 21s, then X = ®1(X1) + ®2(X2) = —0.5hb, som = —3, 
M = 2. Wealso put m = —3 and M = 2. We obtain 


(®1(X1) + ©2(X2))* = 0.0625n < 48.5 = © (xt) 4-Dy (x4) 
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F (,(X;)) + (o(X2)) S o,(F (X;)) + ba(F (X2))- 5X, 
where 
5, =f (m)+f(M)—2f (M+m)/2) 


M+m 
2 


: x1, __ 1 = 
ws a : Rank wm P1%) + 2(%2) Ik 


lll > ——— “-—- - 
m, M=m =m, M, =m, M, 


Figure 2. Refinement for two operators and a convex function f 


and its improvement 


(®,(X1) + &o(X))* = 0.0625 = O, (xt) +, (x3) — 48.4375 Ip 


since of = 96.875, X = 0.5Ib. We remark that in this case my = My = —1/2 and X(my, Mx) is not 
defined. 


II) Next, we observe an example when 6 xX is not equal to the difference RHS and LHS of Jensen’s 
inequality and my = Mx. If 


-10 0 200 ae 
X,= {0-2 0 |, X»=]030 then x= 3 (58) andm=-—1, M=2 
0 0 -1 004 


In this case x(mx, Mx) is not defined, since my = My = 1/2. We have 


(®1(X1) + ©2(X2))* = x € i) < G ) = 9) (xt) + 2 (x3) 


and putting m = —1, M = 2 we obtain df = 135/8, X = Ih/2 which give the following improvement 


(®,(X1) + ©2(X2))* = a ic :) < - € an) =e (xi) +22 (x2) 7 iE ¢ ') 


U1) Next, we observe an example with matrices that are not special. If 


-411 5 —1=1 1/10 
X,={1 -2-1 and X»={-12 1], then X= 5 € i) 
1-1-1 -11 3 


som, = —4.8662, My = —0.3446, mz = 1.3446, Mz = 5.8662, m = —0.3446, M = 1.3446 and 
we put m =m, M = M (rounded to four decimal places). We have 


1 (10 1283 255 
(®1(X1) + ®2(Xp))* = 16 (( 5) = es 237 ) = (x7) + 2 (x2) 
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and its improvement 
4 1/10 
(®1(X1) + 2(X2))" = FE (og 


639.9213 -255 \_ (v4 4) (15787 0 
. ( 255 eee) = % (Xf) +@ (x2) - ( 0 ogi) 


05 0 
0 0.2040 


M = Myx = 0.5, then X= 0, so we do not have an improvement of Jensen's inequality. Also, if we put 


m =0,M =1, then X =05 G i) of = 7/8 and 5¢X = 0.4375 (( Ht) which is worse than the 


(rounded to four decimal places), since of = 3.1574, X = ( ), But, if we put m = my = 0, 


01 01 
above improvement. 


Putting ®;(y) = ay for every y € A, where a; > 0 is a real number, we obtain the following 
obvious corollary of Theorem 12. 


Corollary 14. Let (x;):e7 be a bounded continuous field of self-adjoint elements in a unital C*-algebra 
A defined on a locally compact Hausdorff space T equipped with a bounded Radon measure y.. Let mt 
and Mz, my < Mt, be the bounds of xz, t € T. Let (at)ter be a continuous field of nonnegative real 
numbers such that | a+ du(t) = 1. Let 


(mx,Mx) [me Mik] =O, t ET, and m<M 
where my and My, my < My, are the bounds of the operator x = te ge(xp) du(t) and 
m= sup {M:i: My < my,t € T}, M = inf {m: my > My,t € T} 


If f : 1 + Risa continuous convex (resp. concave) function provided that the interval I contains all 
mt, M;, then 


f( fommedn(t)) < Spars) du(t) — 898 < frarf(as) adult) 


(resp. f( [ mxedu(t)) > Sraeflxe) du(t) + 5)8 > fr-acf(as) du(t) ) 


holds, where 5¢ is defined by (28), ¥ = 51H — z= Sr ayx; du(t) — 24M 1n| and m € |m,ma], 
M € [Ma,M\], m < M, are arbitrary numbers. 


5. Extension Jensen’s inequality 


In this section we present an extension of Jensen’s operator inequality for n—tuples of 
self-adjoint operators, unital n—tuples of positive linear mappings and real valued continuous 
convex functions with conditions on the spectra of the operators. 

In a discrete version of Theorem 6 we prove that Jensen’s operator inequality holds for every 
continuous convex function and for every n—tuple of self-adjoint operators (Aj,...,An), 
for every n—tuple of positive linear mappings (®1,...,®,) in the case when the interval 
with bounds of the operator A = )"!_, ®;(A;) has no intersection points with the interval 
with bounds of the operator A; for each i = 1,...,n, ie. when (my4,M,4)M [m;,M;j| = © 
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fori = 1,...,n, where ma and My, ma < Mg, are the bounds of A, and m; and M,, 
m; < Mj, are the bounds of A;,i = 1,...,n. It is interesting to consider the case when 
(ma,Ma,)O [m;, Mj] = @ is valid for several i € {1,...,n}, but not for alli = 1,...,n. We 
study it in the following theorem (see [21)]). 

Theorem 15. Let (A1,...,An) be an n—tuple of self-adjoint operators A; € B(H) with the bounds 
m, and M,,m; < Mj,i=1,...,n. Let (®1,...,®,) be an n—tuple of positive linear mappings ®; : 
B(H) — B(K), such that ?_, ®j(14) = 1x. For 1 < ny <n, we denote m = min{my,...,Mn, }, 
M = max{Mj,...,Mn,} and D2, (1H) = @1x, Vey Pi(1H) = 61x, where «,B > 0, 
a+B=1If 

(m, M) 1 [m;, M;] = ©, i=n,+1,...,n 


and one of two equalities 


is valid, then 


1S @(f(A)) < Lea) <4 YS o(F(A) G1) 
i=1 i 


holds for every continuous convex function f : I + IR provided that the interval I contains all mj, Mj, 
i=1,...,n. If f : 1 + Ris concave, then the reverse inequality is valid in (31). 


Proof. We prove only the case when f is a convex function. Let us denote 


1 ny al n n 
A=—J)° (Aj), B=—e es ®;(Aj), C=) 9;(A) 
* fA B i=n4+1 i=1 


It is easy to verify that A = B or B= Cor A=Cimplies A= B=C. 
a) Let m < M. Since f is convex on [m, M] and [m;, Mj] C [m, M] fori = 1,...,n1, then 


f2< a fm) ! aa af (M), z€[m;,M,] fori=1,...,m (32) 


but since f is convex on all [m;, M;] and (m, M) N [m;, Mj] = @ fori = n, +1,...,n, then 


f(z) > a (mn) a f(M), z€ ([m;,M,;] fori=m +1,...,n (33) 


—m 


Since mjlpy < Aj < Mjly,i=1,...,m, it follows from (32) 


M1y — A; A; —m1 : 
f (Ai) $ Rt fm) + SBM), i= 1m 


Applying a positive linear mapping ©; and summing, we obtain 


Fe oy (f(A) < MEE Eee SHAD gy 4, Eis SHAD = Oe oy 
i=1 
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since 7, ®j(14) = #1. It follows 


12 
pe AS ee, OO ey OO (34) 


Similarly to (34) in the case mjly < Aj < Mjly,i =n, +1,...,n, it follows from (33) 


5 BFAD) = SE slm) + EM) (35) 


B i=n4+1 


Combining (34) and (35) and taking into account that A = B, we obtain 


TYAN ss Ve esa) (36) 
i=1 i=ny+1 
It follows 
1 ¥* @,(f(A)) = VN O(F(A)) +2 (F(A) (by «+B =1) 
i=1 i=1 i=1 
< V@(f(A)) + Yo &,(f(A)) (by (36)) 
i=1 i=n+1 
= Lefty) 
<5 @:(F(A)) + J O:(f(Ai)) (by (36)) 
i=m+1 i=n+1 
=5  &;(F(A;)) (by «+8 =1) 
i=n+1 


which gives the desired double inequality (31). 
b) Let m = M. Since [m;, Mj] C [m, M] fori =1,...,1,, then A; = m1q and f(A;) = f(m)1y 
fori =1,...,ny. It follows 


| 4 1 4 
= 5 0(Aj) = mlx and ; Ye 9; (f(Ai)) = f(m)1k (37) 
i=1 i=1 
On the other hand, since f is convex on I, we have 
f(z) > f(m) +1(m)(z—m) for every z € I (38) 


where | is the subdifferential of f. Replacing z by A; fori = n, +1,...,n, applying ®; and 
summing, we obtain from (38) and (37) 


; 5 0, (f(A) = fone +10) ( 5 9,(4,)— mx) 


i=n+1 


So (36) holds again. The remaining part of the proof is the same as in the case a). 


Recent Research on Jensen's Inequality for Operators 205 


Remark 16. We obtain the equivalent inequality to the one in Theorem 15 in the case when 
LL, ®i(1H) = 71k, for some positive scalar . If « + B = y and one of two equalities 


1 ny 1 n 1 n 
=} ' (A; => @;(A;) = — J) OA 
roe (Ai) Bi, (Aj) 7h (Ai) 
is valid, then 
FL Oi(F(Ad) <> Le(FAD) Se YL PUfAD) 
i=t Via B sad 


holds for every continuous convex function f. 


Remark 17. Let the assumptions of Theorem 15 be valid. 
1. We observe that the following inequality 


1 nN 1 n 
(3 > 94) S3 DL ®(f(Ai)) 
i=n+1 i=n+1 
holds for every continuous convex function f : I > R. 
Indeed, by the assumptions of Theorem 15 we have 
Ny 1 ny 1 n 
maly < YP i(f(Ai)) < Maly and — (Aj) = B Le (Aj) 
i=1 i=1 i 
which implies 


n 
1 
mly< )) 3%((f(Ai)) < M1x 

i=n+1 B 
Also (m, M) 9 [m;, Mj] = @ fori = 11 +1,...,n and Vien 44 pi (Li) = 1x hold. So we can 
apply Theorem 6 on operators Ay, +1,..-, An and mappings poi and obtain the desired inequality. 
2. We denote by mc and Mc the bounds of C = Yo_, ®;(Aj). If (mtc,Mc) N [m;,Mj] = @, 
i =1,...,m or f is an operator convex function on [m, M], then the double inequality (31) can be 
extended from the left side if we use Jensen's operator inequality (see [16, Theorem 2.1]) 


(Zea =f (2 e409) 


i=] 


Example 18. If neither assumptions (mc,Mc) [m;,M;| = @,i = 1,...,m1, nor f is operator 
convex in Remark 17 - 2. is satisfied and if 1 < n, <n, then (31) can not be extended by Jensen's 
operator inequality, since it is not valid. Indeed, for ny = 2 we define mappings ®1, Pz : M3(C) > 
Mp(C) by ®1((aij)1<i,7<3) = § (aij )1<i,j<2, Pz = P1. Then Oy (Iz) + ®2 (13) = ah. If 


‘ba 100 
A,=21001] and Ay=21000 


111 000 
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then 
1 1 * 1 /160\ .1/8040\ 1 a, 1 j 
(u(y " x(2)) ~ & ) ar & 5a) = 201 (Ai) + 5% (4%) 


for every « € (0,1). We observe that f(t) = t* is not operator convex and (mc, Mc) (mi, Mi] 4 
20 

@, since C = A = 1,(A;)+44)(A2) = } € o [mc,Mc] = [0,2/a], [m,My] < 

[—1.60388, 4.49396] and [m2, Mp] = [0,2]. 


With respect to Remark 16, we obtain the following obvious corollary of Theorem 15. 


Corollary 19. Let (A1,...,An) be an n—tuple of self-adjoint operators A; € B(H) with the bounds 
m, and M;,m; < Mj,i = 1,...,n. For some 1 < n, < n, we denote m = min{my,...,mn,}, 
M = max{My,...,Mn,}. Let (p1,.--,Pn) be an n—tuple of non-negative numbers, such that 0 < 
iy Pi = Pm < Pa = Lin Pi If 


(m,M) Q [m;, Mj] = ©, i=n,+1,...,n 


and one of two equalities 


1s 1s 1 yh 
— Aj = — Aj = A 
Pm j=] ee n j=] i Pn — Pry jay a 
is valid, then 
1 ny i n 1 n 
— ) pif(Ai) < — ¥ pif (Ai) < ——— Yo pif(Ai (39) 
Poi j=} Pn j=} Pn ~ Pm jon 41 


holds for every continuous convex function f : I + IR provided that the interval I contains all mj, Mj, 
BHA, ose: 


If f : 1 + Ris concave, then the reverse inequality is valid in (39). 


As a special case of Corollary 19 we can obtain a discrete version of Corollary 7 as follows. 


Corollary 20 (Discrete version of Corollary 7). Let (A1,...,An) be an n—tuple of self-adjoint 
operators A; € B(H) with the bounds m; and Mj, mj < Mj,i = 1,...,n. Let (a4,...,0n) be an 
n—tuple of nonnegative real numbers such that 7", a; = 1. If 


(ma,M,)N [m;, Mj] = 2, i=1,...,n (40) 


where ma, and Ma,m, < Mza,are the bounds of A = a a; Aj, then 


i=1 


f (z wAi) < ¥ aif (Ai) (41) 
j=l 


holds for every continuous convex function f : I —> IR provided that the interval I contains all mj, Mj. 
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Proof. We prove only the convex case. We define (n + 1)—tuple of operators (By,...,By 41), 
B; € B(H), by B} = A = YL, 4;A; and B; = Aj_y,i = 2,...,n+1. Then mg, = mya, 
Mp, = Mz, are the bounds of By and mg, = mj_1, Mp, = Mj-1 are the ones of Bj, i = 
2,...,n +1. Also, we define (n + 1)— fae of non-negative numbers (p,..-,Py+1) by py =1 
and pj = aj-1,i1=2,...,n+1. Then a, 1 Pi = 2 and by using (40) we have 


EE SiS =@, i=2,...,n+1 (42) 
Since 
n+1 n+1 
L, PBs = B,+ de iB = La + Dads = 9B, 
then 
y ntl n+1 
piBy = ae piBi = — dL piB i (43) 


Taking into account (42) and (43), we can apo Gorcilaay 19 for ny = 1 and Bj, p; as above, 
and we get 
y ntl n+1 


pif ( By) < 7 Pit pif Bi) <b pif (Bi) 


which gives the desired inequality (41). 


6. Extension of the refined Jensen’s inequality 


There is an extensive literature devoted to Jensen’s inequality concerning different refinements 
and extensive results, see, for example [22-29]. 


In this section we present an extension of the refined Jensen’s inequality obtained in Section 4 
and a refinement of the same inequality obtained in Section 5. 
Theorem 21. Let (A1,...,An) be an n—tuple of self-adjoint operators A; € B(H) with the bounds 
m, and M;,m; < M,,i = 1,...,n. Let (®1,...,®,) be an n—tuple of positive linear mappings 
®; : B(H) > B(K), such that Y;"", ®j\(1H) = w1k, Vien 41 Pin) = B 1x, where 1 < ny <1, 
a,B > Oanda+ Bp =1. Let my, = min{m,...,mn,}, Mr = max{My,...,Mn,} and 

m = max{M;: M; < mz,i € {m +1,...,n}} 

M = min{m;: m; > Mr,i € {n, +1,...,n}} 
If 

(my,Mr)O[m;,M;| =O, i=ny,+1,...,n, and m<M 

and one of two equalities 


* yo o(A) = 5 ,(A;) = j 2 ;(Aj) 
i=1 i=1 i=n+1 
is valid, then 
: ¥ e(f(AD) < : Y P(F(AD) + pod < ye e(f(Ad) 


Ysa) 4) 
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holds for every continuous convex function f : I —> IR provided that the interval I contains all m;, Mj, 
i=1,...,n, where 


: n+ M 
5 = 5y(m, M1) = fm) + fm) ~2¢ (= 5") 
an ae 7 ny 7 / (45) 
A= Agomaltt,M) = sl ai a yo; (|a:- me) 
i=1 


and m€ [m,m,],M € [Mpr,M], m < M, are arbitrary numbers. If f : I > R is concave, then the 
reverse inequality is valid in (44). 


Proof. We prove only the convex case. Let us denote 
1 ny 1 n n 
= PA) 3 2° Ai), C= J (Aj) 
. i=l ~ By m+ i=1 


It is easy to verify that A = B or B= Cor A=Cimplies A= B=C., 
Since f is convex on [171, M] and Sp(A;) C [m;,M;] C [m,M] fori = 1,...,11, it follows from 
Lemma 11 that 


f (Ai) < SBA gm) 4 SEH pa) — 5p, f= 1.2m 


holds, where 6 = f(m) + f(M) — 2f (4%) and A; = }1y = Ai mM |, 
Applying a positive linear mapping ©; and summing, we obtain 


5h, & (f(A) < Melk = SA) 


Lae CONE i aya (46) 


where A = 31x — agi DI) ®; (|4i = mM |). 


Additionally, since f is convex on all [m;,M;] and (m,M) M [m;,M;] = @,i = 1, +1,...,n, 
then : 
M1yp — A; ,, _ Aj-m1y _ ; 
eae ee } = mor 

fay) > HA gem) 4 MM gy, mt. 
It follows 

1 2 oe MIG=F oc B= ily es 

zg. Pi(f(Ai)) — 5A = Mom ft) + samy SOM) — 8A (47) 
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Combining (46) and (47) and taking into account that A = B, we obtain 


1 ¥ @;(f(Ai) < : (F(A) — 5A (48) 
i=1 i=n+1 
Next, we obtain 
*Ee(f(Ai) 


= Mi 
SEO (AN)+ Le BFA) BHA (by (48)) 
< Ye BUFAD)—AbpA+ YO @i(F(Ai)) ~ Bap (by (48)) 
= 7 Ee eftay) ~6,A (by a+f=1) 


Adding Bd pA in the above inequalities, we get 
1 <£ ~ 
Lew )) + BofA < Los Aj)) < A. Yo i (f(Ai)) — 5A (49) 
i= 


Now, we remark that 5 if 20 and A > 0. (Indeed, since f is convex, then f ((1m-+ M)/2) < 
(f (im) + f(M))/2, which implies that 5¢ > 0. Also, since 


M+n vl — 
Sp(Aj) C [m,M]) = Ai er] se, i=1,...,m 
then ; 7 _ 
: 3 
yo; (cs mul) 2 M Ot 
L 2 
i=1 
which gives 
1 a M+m 
O<=l1k- = @, | |A 1 =A 
Kay (4 Al) = 4) 
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Consequently, the following inequalities 


1S @i(f(Ai)) < +S BL f(AD)) + Bd A 
i=1 i=1 

} yO (F(A)) —08,A< 41 Yo @,(F(A)) 
i=n+1 i=n+1 


hold, which with (49) proves the desired series inequalities (44). 


Example 22. We observe the matrix case of Theorem 21 for f(t) = t*, which is the convex function 
but not operator convex, n = 4, ny = 2 and the bounds of matrices as in Fig. 3. We show an example 


m, M.= m=m, m, M, M)=Mp M=m, M, 


Figure 3. An example a convex function and the bounds of four operators 


such that 


£ (®1(At) + ©2(A5)) < } (1 (Af) + 2(A})) + BdpA 
< @; (At) + &2(A5) + 3(A$) + &4(AG) (50) 
< 5 (®3(A3) + ®4(Af)) — ad pA < 5 (©3(A$) + &4(AQ)) 


holds, where 6¢ = M* + m* — (M+m)*/8 and 


~ 1 if M+m M+m 
A= -—!I = ® Ai - I D Ad — I 
sla ogy (#1 (lr - SE a) + 2 (142 - MF *aI)) 


We define mappings ®; : M3(C) — M(C) as follows: ®;((ajx)1<jn<3) = a (Vai<jkcr = 
1,...74, Then Te_, O;(15) = anda =p =}. 


Let 


2 9/81 2 9/80 4 1/21 5/3 1/20 
A, =2{9/8 2 0],A2=3]9/8 1 0}),A3=-3]1/2 4 0) ,Ag=12 1] 1/23/20 
1 0 3 0 O 1 0 2 0 0 3 


N 


Then my = 1.28607, M, = 7.70771, mz = 0.53777, Mz = 5.46221, m3 = —14.15050, M3 = 
—4.71071, mg = 12.91724, My = 36.,s0 mp = mz, Mr = My, m = Mz and M = mg (rounded to 
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five decimal places). Also, 


1 “41 (4 9/4 
1 (@1(A1) + &2(Aa)) = 5 (@a(4a) + 444) = (974 "5" 
and 
1 , 4) _ (989.0391 663.46875 
ee (®1(47) +@2(43)) ~ aa eee 
68093.14258 48477.98437 
= 4 4 4 4) _ 
Cp Pian) aaa eta) aay) oe Se 
1 , ay) _ (135197.28125  96292.5 
a= (@3(43) + 4(42)) = ( 96292.5 cas 
Then 


holds (which is consistent with (31)). 
We will choose three pairs of numbers (m,M), m € [—4.71071, 0.53777], M € [7.70771, 12.91724] 


as follows 
i) m= my, = 0.53777, M = Mr = 7.70771, then 


0.15678 0.09030\ — /231.38908 133.26139 
0.09030 0.15943) ~ \133.26139 235.29515 


ii) m=m = —4.71071, M = M = 12.91724, then 
0.36022 ae) = Gorin oo) 


Ay = Bo, A = 0.5 - 2951.69249 - ( 


es ia cd ce 0.36155 496.04498 5019.50711 
iii) m= —1, M = 10, then 

0.08975 0.27557) — \ 411.999 1265. 

New, we obtain the following improvement of (51) (see (50)) 


Roe p5-A — 0.5-9180.875- Gone een _ ee oo) 


i Ape ey ae Gaeen Bs 


796.73014 761.42406 

< Cy < (13496589217 9615923861) _, x 2 

f 96159.23861 101909.36110/ ~f “lf 

* 5989.90251 1159.51373 

ii) Ap <Arpthy = 

a Me ie Gate oa 

< C, < (130196.38265 95796.45502 

f 95796.45502 97125.14914 
2283.66362 1075.46746 
1075.46746 1791.12874 


) = By— Ba < By 


iii) Ap < Ap +A3 = ( 
< C, < (133902.62153 95880.50129) _, 5 
‘j 95880.50129 100879.65641) ~“f “3 ~°f 
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Using Theorem 21 we get the following result. 


Corollary 23. Let the assumptions of Theorem 21 hold. Then 


+L ei(fA)) < TL esa) +nsAsz VY, efi) (52) 
i=1 i=1 i=n+1 
and n n n 
TS ofA) <a YL Oi(f(AD) — 15/4 <4 (F(A) (53) 
eee B jd B i=ny+1 


holds for every 1, ‘Y2 in the close interval joining a and B, where 5¢ and A are defined by (45). 


Proof. Adding «é A in (44) and noticing 6 pA > 0, we obtain 

1 ny 1 ny ms 1 n 
a d, e(F(49)) ae , Pi(F(Ai)) +adsA< = Yi (f(A) 
1= [= 


i=n+1 


Taking into account the above inequality and the left hand side of (44) we obtain (52). 


Similarly, subtracting Bd pA in (44) we obtain (53). 


Remark 24. We can obtain extensions of inequalities which are given in Remark 16 and 17. Also, we 
can obtain a special case of Theorem 21 with the convex combination of operators A; putting ®;(B) = 
«;B, fori = 1,...,n, similarly as in Corollary 19. Finally, applying this result, we can give another 
proof of Corollary 14. The interested reader can see the details in [30]. 
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Chapter 10 


A Linear System of Both Equations 
and Inequalities in Max-Algebra 


Abdulhadi Aminu 


Additional information is available at the end of the chapter 


http://dx.doi.org/10.5772/48195 


1. Introduction 


The aim of this chapter is to present a system of linear equation and inequalities in 
max-algebra. Max-algebra is an analogue of linear algebra developed on the pair of operations 
(®,®) extended to matrices and vectors, where a®b = max(a,b) anda@®b = a+b for 
a,b € R. The system of equations A ® x = c and inequalities B® x < d have each been studied 
in the literature. We will present necessary and sufficient conditions for the solvability of a 
system consisting of these two systems and also develop a polynomial algorithm for solving 
max-linear program whose constraints are max-linear equations and inequalities. Moreover, 
some solvability concepts of an inteval system of linear equations and inequalities will also be 
presented. 


Max-algebraic linear systems were investigated in the first publications which deal with 
the introduction of algebraic structures called (max,+) algebras. Systems of equations with 
variables only on one side were considered in these publications [1, 2] and [3]. Other systems 
with a special structure were investigated in the context of solving eigenvalue problems in 
correspondence with algebraic structures or synchronisation of discrete event systems, see [4] 
and also [1] for additional information. Given a matrix A, a vector b of an appropriate size, 
using the notation © = max, ® = plus, the studied systems had one of the following forms: 
A®x=b,A®x=xorA@®x=x@b. An infinite dimensional generalisation can be found 
in [5]. 


In [1] Cuninghame-Green showed that the problem A ® x = b can be solved using residuation 
[6]. That is the equality in A ® x = b be relaxed so that the set of its sub-solutions is studied. 
It was shown that the greatest solution of A ® x < b is given by X where 

= . = ‘ 

R= min (bj @ aj ) forallj ¢ N 
The equation A ® x = bis also solved using the above result as follows: The equation A ® x = 


b has solution if and only if A ® x = b. Also, Gaubert [7] proposed a method for solving the 
one-sided system x = A ® x © b using rational calculus. 
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Zimmermann [3] developed a method for solving A ®x = b by set covering and also 
presented an algorithm for solving max-linear programs with one sided constraints. This 
method is proved to has a computational complexity of O(mn), where m,n are the number 
of rows and columns of input matrices respectively. Akian, Gaubert and Kolokoltsov [5] 
extended Zimmermann’s solution method by set covering to the case of functional Galois 
connections. 


Butkovic [8] developed a max-algebraic method for finding all solutions to a system of 
inequalities x; — xj > bij,1, j = 1,..,n using n generators. Using this method Butkovic 
[8] developed a pseudopolynomial algorithm which either finds a bounded mixed-integer 
solution, or decides that no such solution exists. Summary of these results can be found in [9] 
and [10] 


Cechlarova and Diko [11] proposed a method for resolving infeasibility of the system A ® x = 
b . The techniques presented in this method are to modify the right-hand side as little as 
possible or to omit some equations. It was shown that the problem of finding the minimum 
number of those equations is NP-complete. 


2. Max-algebra and some basic definitions 


In this section we introduce max-algebra, give the essential definitions and show how the 
operations of max-algebra can be extended to matrices and vectors. 


In max-algebra, we replace addition and multiplication, the binary operations in conventional 
linear algebra, by maximum and addition respectively. For any problem that involves adding 
numbers together and taking the maximum of numbers, it may be possible to describe it 
in max-algebra. A problem that is nonlinear when described in conventional terms may be 
converted to a max-algebraic problem that is linear with respect to (@, ®) = (max, +). 


Definition 1. The max-plus semiring R is the set RU {—co}, equipped with the addition 
(a,b) ++ max(a,b) and multiplication (a,b) + a+b denoted by © and ® respectively. That 
isa @b = max(a,b) anda®b=a-+b. The identity element for the addition (or zero) is —co, 
and the identity element for the multiplication (or unit) is 0. 


Definition 2. The min-plus semiring Rmin is the set RU {+00}, equipped with the addition 
(a,b) ++ min(a,b) and multiplication (a,b) + a+b denoted by ©’ and @’ respectively. The 
zero is +00, and the unit is 0. The name tropical semiring is also used as a synonym of min-plus 
when the ground set is IN. 


The completed max-plus semiring Riax is the set IRU {+00}, equipped with the addition 
(a,b) ++ max(a,b) and multiplication (a,b) +> a + b, with the convention that —oo + (+00) = 
+00 + (—co) = —co, The completed min-plus semiring Ryin is defined in the dual way. 


Proposition 1. The following properties hold for all a,b,c € R: 
agb=bea 
a®b=bea 

a@(b@c)=(a@b)@c 
a®(b@c)=(a@b)@c 
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a®(b@c)=aSbeaec 


a@® (—00) = —co = (—0) Ga 
a@0=a=08a 


a@a! =0,4,a ! ER 


Proof. 
The statements follow from the definitions. 


Proposition 2. For all a,b,c € IR the following properties hold: 
ax<b=a0@c<b@c 
ax<b&=a®8c<b@ccER 
ax<b==a@eb=b 


a>b<=a®c>b®&c -w~<c< +o 


Proof. The statements follow from definitions. 


The pair of operations (@,®) is extended to matrices and vectors as in the conventional linear 
algebra as follows: For A = (a,j), B = (b;j) of compatible sizes and a € IR we have: 


A@B= (aij © bij) 
= ® 
A@®B= bs a9) 
k 


a @ A = (a @ aij) 


Example 1. 
18)0(-4.82)- (248) 
Example 2. 
=a N= lg a 
( 30 : (: ) 
=(“Br(ewrewss @se0ree+ )~ (ns) 


Example 3. 


782 77 
ne ¢ 7 = & a io) 
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Proposition 3. 
For A, B,C € R”*” of compatible sizes, the following properties hold: 


A®B=BOA 
A® (BOC) =(A@B)OC 
A®(B@C)=(A@B)@C 
A®(BOC)=ABBEABC 
(A®B)@®C=A@C@BEC 


Proof. 
The statements follow from the definitions. 


Proposition 4. 

The following hold for A, B,C, a,b,c, x,y of compatible sizes and a, B € R: 
A®(a®B)=a@(A@B) 
a®(APB)=x@AGaGB 
(aS B)®A=a@ASHOB 

x @a@y=a@x' @y 
a<b=c' @aK<c' ob 
A<B=AO6C<BOC 
A<B=A®8CK<B@eC 
A<Be>A@GB=B 


Proof. The statements follow from the definition of the pair of operations (®,®). 
Definition 3. Given real numbers a,b,c,...,a max-algebraic diagonal matrix is defined as: 


a 


diag(a,b,c,...) = 


Given a vector d = (dj,dz,...,dyn), the diagonal of the vector d is denoted as diag(d) = 
diag (d1,d>, sien dn). 


Definition 4. Max-algebraic identity matrix is a diagonal matrix with all diagonal entries zero. 
We denote by I an identity matrix. Therefore, identity matrix I = diag(0,0,0,...). 
It is obvious that A ® I = I @ A for any matrices A and I of compatible sizes. 


Definition 5. Any matrix that can be obtained from the identity matrix, I, by permuting its 
rows and or columns is called a permutation matrix. A matrix arising as a product of a diagonal 
matrix and a permutation matrix is called a generalised permutation matrix [12]. 
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Definition 6. A matrix A € R"*” is invertible if there exists a matrix B € Rr”, such that 
A®B = B®@A =I. The matrix B is unique and will be called the inverse of A. We will 
henceforth denote B by At, 


It has been shown in [1] that a matrix is invertible if and only if it is a generalised permutation 


matrix. If x = (x1,...,Xn) we will denote x —_— (44 ee ma that is x! = —x, in 


conventional notation. 


Example 4. 
Consider the following matrices 
—oo —0o 3 —oo —5 —0o 
A= 5 —co —oo |] and B= | —co —oo —8 
—oo 8 —oo —3 —0o —oo 


The matrix B is an inverse of A because, 


—co-co 3 —co —5 —oo 0 —co —oo 
AQB= 5 —co —co |] @ co —co 8] = fore) 0 —oo 
—oo 8 —co 3 —0co —oo —co-oco 0 


Given a matrix A = (a,j) € R, the transpose of A will be denoted by A’, that is AT = (aji). 
Structures of discrete-event dynamic systems may be represented by square matrices A over 
the semiring: 

R = ({-—co} UR, @, ®) = ({—co} UR, max, +) 
The system ¥t is embeddable in the self-dual system: 


R = ({-co} UR{+00},@,®,©,®) = ({-co} UR{+0o}, max, +, min, +) 
Basic algebraic properties for 6’ and @ are similar to those of © and @ described earlier. 
These are obtained by swapping < and > . Extension of the pair (@', ®') to matrices and 
vectors is as follows: 
Given A, B of compatible sizes and a € R, we define the following: 


A® B= (aj @ bi) 


A® B= ty ain @ mi) = min (ax + by) 
k 
“a@ A=(a® aij) 
Also, properties of matrices for the pair (e, @) are similar to those of (©, ®), just swap 


< and >. For any matrix A = [a;;] over R, the conjugate matrix is A* = [—a;j] obtained by 
negation and transposition, that is A = —Al, 


Proposition 5. The following relations hold for any matrices U, V, W over R. 


(U@® V)@W<U®@ (VEW) (1) 
U@(U*@ W)<W (2) 
U®@(U*@ (U@W)) =UE@W (3) 


Proof. Follows from the definitions. 
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3. The Multiprocessor Interactive System (MPIS): A practical application 


Linear equations and inequalities in max-algebra have a considerable number of applications, 
the model we present here is called the multiprocessor interactive system (MPIS) which is 
formulated as follows: 


Products P,,...,Pm are prepared using n processors, every processor contributing to the 
completion of each product by producing a partial product. It is assumed that every processor 
can work on all products simultaneously and that all these actions on a processor start as soon 
as the processor is ready to work. Let aj be the duration of the work of the il processor 
needed to complete the partial product for P; (i = 1,...,m;j = 1,...,n). Let us denote 
by xj the starting time of the jf" processor (j = 1,...,m). Then, all partial products for P; 
(i=1,...,m;j =1,...,n) will be ready at time max(aj + %1,...,din + Xn). If the completion 
times b1,...,bm are given for each product then the starting times have to satisfy the following 
system of equations: 


max(aj1 + X1,..-,@in + Xn) = b; for allie M 


Using the notation a © b = max(a,b) anda ®b=a-+b fora,b € R extended to matrices and 
vectors in the same way as in linear algebra, then this system can be written as 


A®Sx=b (4) 


Any system of the form (4) is called ‘one-sided max-linear system’. Also, if the requirement 
is that each product is to be produced on or before the completion times by,..., bm, then the 
starting times have to satisfy 


max(aj1 + X1,..-,@in + Xn) < b; for allie M 


which can also be written as 
A@x<b (5) 


The system of inequalities (5) is called ‘one-sided max-linear system of inequalities’. 


4, Linear equations and inequalities in max-algebra 


In this section we will present a system of linear equation and inequalities in max-algebra. 
Solvability conditions for linear system and inequalities will each be presented. A system 
consisting of max-linear equations and inequalities will also be discussed and necessary and 
sufficient conditions for the solvability of this system will be presented. 


4.1. System of equations 


In this section we present a solution method for the system A ® x = DB as given in [1, 3, 13] 
and also in the monograph [10]. Results concerning the existence and uniqueness of solution 
to the system will also be presented. 


pmxn 


Given A = (a;;) ER and b = (by,...,bm)’ € R", a system of the form 


A@x=b (6) 
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is called a one-sided max-linear system, some times we may omit ’max-linear’ and say one-sided 
system. This system can be written using the conventional notation as follows 


Ras (aij + xj) =b,iEM (7) 
j=l..m 


The system in (7) can be written after subtracting the right-hand sides constants as 


max (aj @ b; + +xj)=0,1EeM 
j=1,...m 


A one-sided max-linear system whose all right hand side constants are zero is called normalised 
max-linear system or just normalised and the process of subtracting the right-hand side constants 
is called normalisation. Equivalently, normalisation is the process of multiplying the system (6) 
by the matrix B’ from the left. That is 


B @A@x=B @b=0 


where, ; 
B = diag(b,',by',...,b;,') = diag(b~') 


For instance, consider the following one-sided system: 


713 xy 5 
302)/@([x)=| 6 (8) 
127 x3 3 


After normalisation, this system is equivalent to 


-7 -4-2 x4 0 
—3-6-4]@{x]={0 
—2—-1-2 x3 0 
That is after multiplying the system (8) by 
—5 —co —0o0 


—co —6 —c 
—co —oo —3 


Consider the first equation of the normalised system above, that is max(x, — 7,x2 — 4,x3 — 
2) = 0. This means that if (x1,x2,x3)' is a solution to this system then x; < 7,x2 < 4, 
x3 < 2 and at least one of these inequalities will be satisfied with equality. From the other 
equations of the system, we have for x1 < 3, x; < 2, hence we have x, < min(7,3,2) = 

max(—7,—3,—2) = —X, where —%, is the column 1 maximum. It is clear that for all j then 
xj < x;, where —X; is the column j maximum. At the same time equality must be attained in 
some of these inequalities so that in every row there is at least one column maximum which 
is attained by x i: This observation was made in [3]. 


Definition 7. A matrix A is called doubly R-astic [14, 15], if it has at least one finite element on 
each row and on each column. 
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We introduce the following notations 
S(A,b) ={x €R";A@x=)} 
Mj = {k © M;by @ a;) = max(b; ® a;;')} for all j ¢ N 
t 
< on, SS -1 . 
£(A,b)) = min(b; Qa; ) for all j € N 


We now consider the cases when A = —oco and/or b = —oo. Suppose that b = —oo. Then 
S(A, b) can simply be written as 


S(A,b) = {x € R"; x; = —00, if Aj # —00, j € N} 


Therefore if A = —co we have S(A,b) = R" . Now, if A = —coand b 4 —o then S(A,b) = ©. 
Thus, we may assume in this section that A = —co and b # —ov. If by = —oo for some 
k € M then for any x € S(A,b) we have x; = —co if a; # —co, j € N, asa result the kth 
equation could be removed from the system together with every column j in the matrix A 
where aj; # —oo (if any), and set the corresponding x; = —co. Consequently, we may assume 
without loss of generality that b € R”. 


Moreover, if b € IR” and A has an —0oo row then S(A,b) = ©. If there is an —co column j in A 
then x; may take on any value in a solution x. Thus, in what follows we assume without loss 
of generality that A is doubly IR — astic and b € IR”. 


 pwMmNxn 


Theorem 1. Let A = (a;j) € R 
only if 


be doubly R — astic and b € IR". Then x € S(A,b) if and 


i) x < x(A,b) and 
ii) (J) Mj = M where Nx = {j € N; x; = £(A,b);} 
jENy 
Proof. Suppose x € S(A,b). Thus we have, 
A®S®x=b 
<=> max(aj + x;) = 0; for allie M 
J 
=> 4 + x; = b; for somej € N 
SS xj < b; @ aj for alli EM 
: =] ; 
SS min(b; @ aj ) for allj € N 
Hence, x < *. 
Now that x € S(A,b). Since Mj; © M we only need to show that M C Uien, Mj. Let ke M. 
Since bj, = aj ® x; > —oo for some j € N and a = ve 2 jj, ® bt for every i € M we have 


oe 
j 


Suppose that x < x and Ujen, Mj = M. Letk € M,j € N. Then aj; ® xj < dg if a4; = —00. If 
aj # —oo then 


x Ak ® i = maxjcy jj ® be Hence k € Mj and x; = 5;. 


Aj @ Xj S Mpj@ Fj S yj @ be Baz) = dy (9) 
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Therefore A ® x < b. At the same time k € Mj for some j € N satisfying x; = %;. For this j 
both inequalities in (9) are equalities and thus A @ x = b. 


The following is a summary of prerequisites proved in [1] and [12]: 
Theorem 2. Let A = (ajj) € R”*" be doubly R — astic and b € R™. The system A @ x = b 


has a solution if and only if (A, b) is a solution. 


Proof. 
Follows from Theorem 1. 


Since ¥(A,b) has played an important role in the solution of A ® x = b. This vector £ is called 
the principal solution to A®x = b [1], and we will call it likewise. The principal solution 
will also be used when studying the systems A ® x < b and also when solving the one-sided 
system containing both equations and inequalities. The one-sided systems containing both 
equations and inequalities have been studied in [16] and the result will be presented later in 
this chapter. 


Note that the principal solution may not be a solution to the system A ® x = b. More precisely, 
the following are observed in [12]: 


pmxn 


Corollary 1. Let A = (aj) € R 
statements are equivalent: 


be doubly IR — astic and b € IR. Then the following three 


i) S(A,b) FD 
ii) x(A,b) € S(A,b) 
iii) UMj=M 
jEN 
Proof. 
The statements follow from Theorems 1 and 2. 


For the existence of a unique solution to the max-linear system A @®x = b we have the 
following corollary: 


pnNxn 


Corollary 2. Let A = (aj) € R’ ~~ be doubly R — astic and b € IR". Then S(A,b) = {x(A,b)} 
if and only if 


i) U Mj = Mand 
jEN 
ii) LU) Mj; £M forany N’CN,N' AN 
jEN 
Proof. 
Follows from Theorem 1. 


The question of solvability and unique solvability of the system A ® x = b was linked to the 
set covering and minimal set covering problem of combinatorics in [12]. 
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4.2. System of inequalities 


In this section we show how a solution to the one-sided system of inequalities can be obtained. 
Let A = (aj) € R™*" and b = (by,...,bm)7 CR. A system of the form: 


AQx<b (10) 


is called one-sided max-linear system of inequalities or just one-sided system of inequalities. The 
one-sided system of inequalities has received some attention in the past, see [1, 3] and [17] for 
more information. Here, we will only present a result which shows that the principal solution, 
X(A,b) is the greatest solution to (10). That is if (10) has a solution then %(A,b) is the greatest 
of all the solutions. We denote the solution set of (10) by $(A,b, <). That is 


S(A,b,<) = {x € R"; A@x <b} 
Theorem 3. x € S(A,b, <) if and only if x < £(A,b). 


Proof. Suppose x € S(A,b, <). Then we have 


A@x<b 


<= max(ajj + xj) < bj for alli 
J 


SS Ajpt+xj< b; for alli, j 
SH b; ® a;;! for alli, j 
—= x < min(b; @ a;;') for all j 


<— x < #(A,b) 


and the proof is now complete. 


The system of inequalities 


A@x<b 


11 
Cex>d a 
was discussed in [18] where the following result was presented. 


Lemma 1. A system of inequalities (11) has a solution if and only if C @ x(A,b) >d 


4.3. A system containing of both equations and inequalities 

In this section a system containing both equations and inequalities will be presented, the 
results were taken from [16]. Let A = (ajj) € Re" C= (cj) ER", b= (by,...,bg)? € RK 
and d = (dy,...,d;y)" € IR". A one-sided max-linear system with both equations and inequalities is 
of the form: 
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A®x=b 
(12) 
C@x<d 
We shall use the following notation throughout this paper 
R= {1,2,...,7} 
S(A,C,b,d) = {x € R";A@x=bandC@x < d} 
S(C,d,<) ={x E R";C@x < d} 
£;(C,d) = min(d; @ c;;') for all j € N 
icR - 
K = {1,...,k} 
: 1 oe : =—1 i 
j= {ke Kib, @ a5 = min (5,@0;")} forall jeN 
7 dest ; = : 
¥(A,b) = min(b; @aj ) for all j e N 
% = (X1,...,¥n)? 
J ={j © N;5,(C,d) > ¥;(A,b)} and 
L=N\J 
We also define the vector % = (21, £0,...,2n)", where 
2(A.b) if; 
)(A,Cib,d) = AAP) HET (13) 
¥j(C,d) if j EL 


and Ng = {j SC N; 4; = %i he 


Theorem 4. Let A = (aj) € RC = (a) = RO = Bijanghp) R* and d = 
(d1,..., dy)? € IR". Then the following three statements are equivalent: 


(i) S(A,C,b,d) #@ 
(ii) (A,C,b,d) € S(A,C,b,d) 
(iii) JK; =K 


je 


Proof. (i) ==> (ii). Let x € S(A,C,b,d), therefore x € S(A,b) and x € S(C,d,<). Since 
x € S(C,d,<), it follows from Theorem 3 that x < <(C,d). Now that x € S(A,b) and also 
x € S(C,d,<), we need to show that x;(C,d) > %;(A,b) for all j € Nx (that is Ny C J). 
Let j € Ny then x; = x;(A,b). Since x € S(C,d,<) we have x < x(C,d) and therefore 
x;(A,b) < ¥(C,d) thus j € J. Hence, Ny C J and by Theorem 1 Ujey Kj = K. This also proves 
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(i) => (iii) 
(iii) => (i). Suppose Ujes Kj = K. Since £(A,C,b,d) < x(C,d) we have £(A,C,b,d) € 
S(C,d,<). Also £(A,C,b,d) < ¥(A,b) and Ng D J gives Ujensacea Ki 2 Ujes Kj = K- Hence 
enue K; = K, therefore £(A,C,b,d) € S(A,b) and £(A,C,b,d) € S(C,d,<). Hence 
&(A,C,b,d) € S(A,C,b,d) (that is S(A,C,b,d) # ©) and this also proves (iii) ==> (ii). 


Theorem 5. Let A = (a) € R*", C = (cj) € R'™", b = (b1,...,)7 € R* andd = 
(dy,...,dy)? € IR". Then x € S(A,C,b,d) if and only if 


(i) x < £(A,C,b,d) and 


(ii) UU Kj = K where N, = {j € N; xj = x;(A,b)} 
jJENx 


Proof. (=>) Let x € S(A,C,b,d), then x < *(A,b) and x < x(C,d). Since £(A,C,b,d) = 
X(A, b) Q X(C,d) we have x < £(A,C,b,d). Also, x € S(A,C,b,d) implies that x € S(C,d,<). 
It follows from Theorem 1 that Ujen, Kj = K. 

(<=) Suppose that x < £(A,C,b,d) = x(A,b) Q X(C,d) and Ujen, Kj = K. It follows from 
Theorem 1 that x € S(A,b), also by Theorem 3 x € S(C,d,<). Thus x € S(A,b) NS(C,d,<) = 
S(A,C,b,d). 


We introduce the symbol |X| which stands for the number of elements of the set X. 


Lemma 2. Let A = (aj) € Reo = (yy. RA", b= (Biasgtel? R* and d = 
(dy,...,dy)? € IR". If |S(A,C,b,d)| = 1 then |S(A,b)| = 1. 


Proof. Suppose |S(A,C,b,d)| = 1, that is S(A,C,b,d) = {x} for an x € R"”. Since 
S(A,C,b,d) = {x} we have x € S(A,b) and thus S(A,b) # @. For contradiction, suppose 
|S(A,b)| > 1. We need to check the following two cases: (i) L # © and (ii) L = © where 
L = N \ J,and show in each case that |S(A,C,b,d)| > 1. 


Proof of Case (i), that is L A ©: Suppose that L contains only one element say n € N ie 
L = {n}. Since x € S(A,C,b,d) it follows from Theorem 4that £(A,C,b,d) € S(A,C,b,d). 
That is x = £(A,C,b,d) = 

(%1 (A, b), f(A, b),...,Xn-1(A,b), Fn(C,d)) € S(A,C,b,d). It can also be seen that, X(C,d)n < 
¥,(A,b) and any vector of the form z = 

(%1(A,b), %2(A,b),...,%,-1(A,b),a) € S(A,C,b,d), where «a < X,(C,d). Hence 
|S(A,C,b,d)| > 1. If L contains more than one element, then the proof is done in a similar 
way. 


Proof of Case (ii), that is L = © (J = N): Suppose that J = N. Then we have £(A,C,b,d) = 
%(A,b) < %(C,d). Suppose without loss of generality that x,x € S(A,b) such that x # 
x’. Then x < %(A,b) < (C,d) and also x < %(A,b) < x(C,d). Thus, x,x € S(C,d,<). 
Consequently, x, x € S(A,C,b,d) and x # x. Hence |S(A,C,b,d)| > 1. 


Theorem 6. Let A = (ajj) € RX" C= (cj) € R*",b = (by,...,b,)? € R* and d = 
(dy,...,dy)? € IR". If |S(A,C,b,d)| =1 then J = N. 
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Proof. Suppose |S(A,C,b,d)| = 1. It follows from Theorem 4 that Uje; Kj = K. Also, 
|S(A,C,b,d)| =1 cee that |S (A, b)| = 1 (Lemma 2). Moreover, |S(A, b)| = a a i enptics that 
Ujen Kj = K and Vien’ & K,N' CN,N N (Theorem 2). Since J C N and Ujey Kj = K, 
we have ] = N 


Corollary 3. Let A = (a) € Rk", C = (cj) € R™", b = (by,...,%)7 € R¥ andd = 
(dy,...,dy)? € IR". If |S(A,C,b,d)| = 1 then S(A,C,b,d) = {x(A,b)}. 


Proof. The statement follows from Theorem 6 and Lemma 2. 


Corollary 4. Let A = (a) € Rk*", C = (cj) € R™", b = (by,...,4)7 € R¥ andd = 
(dy,... dy)? € R*. Then, the following three statements are equivalent: 


(i) |S(A,C,b,d)| =1 
(ii) |S(A,b)| =1andJ =N 


(iii) ( Kj = Kand |) Kj #K, forevery JC J, J AJ, andJ =N 
je] ics’ 


Proof. (i) ==> (ii) Follows from Lemma 2 and Theorem 6. 
(ii) = > (i) Let J = N, therefore ¥ < X(C,d) and thus S(A,b) C S(C,d,<). Therefore we 
have S(A,C,b,d) = S(A,b) NS(C,d,<) = S(A,b). Hence |S(A,C,b,d)| =1. 

(ii) => (iii) Suppose that S(A,b) = {x} and J = N. It follows from Theorem 2 that 
Ujen Kj = K and Vien’ Kj #K, N CNN £N. Since J = N the statement now follows 
from Theorem 2. 

(iii) => (ii) It is immediate that J = N and the statement now follows from Theorem 2. 


Theorem 7. Let A = (aj) € RC = (aye Rb = Bisa eke)” IR‘ and d = 
(dy,...,dr)? € IRK. If |S(A,C,b,d)| > 1 then |S(A,C,b, d)| is infinite . 


Proof. Suppose |S(A,C,b,d)| > 1. By Corollary 4 we have Ujc; Kj = K, for some J C N, 
J] # N(that is Sj € N such that ¥;(A,b) > x;(C,d)). Now J C N and Uje; Kj = K, Theorem 5 
implies that any vector x = (Xj, X2,..., xn)? of the form 


— fee ifjeJ 
J] y<%(Cd) iffeL 


is in S(A,C,b,d), and the statement follows. 


Remark 1. From Theorem 7 we can say that the number of solutions to the one-sided system 
containing both equations and inequalities can only be 0,1, or oo. 


The vector *(A,C,b,d) plays an important role in the solution of the one-sided system 
containing both equations and inequalities. This role is the same as that of the principal 
solution ¥(A,b) to the one-sided max-linear system A ® x = b, see [19] for more details. 
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5. Max-linear program with equation and inequality constraints 


Suppose that the vector f = (fj, fo,.-,fn)' € IR” is given. The task of minimizing 
[maximizing]the function f(x) = f? @x = max(f, + %1, fi + X2.-, fn + Xn) subject to (12) 
is called max-linear program with one-sided equations and inequalities and will be denoted 
by MLP2in and [MLP2**]. We denote the sets of optimal solutions by s™in(A,C,b,d) and 
smax(A,C,b,d), respectively. 


Lemma 3. Suppose f € IR” and let f(x) = f! @ x be defined on R". Then, 
(i) f (x) is max-linear, ie. f(AQx Ou By) =A® f(x) @u® f(y) for every x,y € R”. 
(ii) f(x) is isotone, i.e. f(x) < f(y) for every x,y € R",x <y. 


Proof. 
(i) Let w € IR. Then we have 


fASx@u@y) =f  @aA@x@fi @Qpneoy 


=A@fl@x@u@f' ey 
=A® f(x) Spe fly) 
and the statement now follows. 
(ii) Let x, y € IR” such that x < y. Since x < y, we have 
max(x) < max(y) 
<> flex< f' @y, forany, f € R" 
= f(x) s fly). 


Note that it would be possible to convert equations to inequalities and conversely but this 
would result in an increase of the number of constraints or variables and thus increasing the 
computational complexity. The method we present here does not require any new constraint 
or variable. 


We denote by 
(A®x);= as + xj) 


A variable x; will be called active if x; = f(x), for some j € N. Also, a variable will be called 
active on the constraint equation if the value (A @ x); is attained at the term x; respectively. It 
follows from Theorem 5 and Lemma 3 that £(A,C,b,d) € S™*(A,C,b,d). We now present a 
polynomial algorithm which finds x € s™n(A,C,b, d) or recognizes that s™n(A,B,c,d) = @. 
Due to Theorem 4 either (A,C,b,d) € S(A,C,b,d) or S(A,C,b,d) = D. Therefore, we assume 
in the following algorithm that S(A,C,b,d) # © and also s™n(A,C,b,d )A@. 


Theorem 8. The algorithm ONEMLP-E] is correct and its computational complexity is O((k + 
r)n?). 
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Algorithm 1 ONEMLP-EI(Max-linear program with one-sided equations and inequalities) 


Input: f = (f,, far-r fa)? € IR", b = (by, bo, bg)? € IRE, d = (dy, do,..dy)? € IR", A = (ay) € 
RE" and C= (e,) € R. 

Output: x € S™"(A,C,b,d). 
1. Find #(A,b), #(C,d), £(A,C,b,d) and Kj, j € JiJ = {j € N;X;(C,d) > x)(A,b)} 


2. x:= £(A,C,b,d) 

3. H(x):= {jE Ni fj +xj =f(x)} 

4. [= J\H(3) 

5. If 
UK #K 
icJ 


then stop (x € S™"(A,C,b,d)) 


6. Set x; small enough (so that it is not active on any equation or inequality) for every 
j € H(x) 
7. Go to3 


Proof. The correctness follows from Theorem 5 and the computational complexity is 
computed as follows. In Step 1 ¥(A,b) is O(kn), while x(C,d), £(A,C,b,d) and Kj can be 
determined in O(rn), O(k + r)n and O(kn) respectively. The loop 3-7 can be repeated at most 
n — 1 times, since the number of elements in J is at most n and in Step 4 at least one element 
will be zoved at a time. Step 3 is O(n), Step 6 is O(kn) and Step 7 is O(n). Hence loop 3-7 
is O(kn*). 


5.1. An example 


Consider the following system max-linear program in which f = (5,6, 1,4, —1)', 


38 4 01 7 
A=[{06 2 21]),b=|{5], 
01-248 7 
—-12-306 5 
C=| 3 4-221] andd=]{ 5). 
13-234 6 


We now make a record run of Algorithm ONEMLP-EL. #(A,b) = (5, —1,3,3, —1)', x(C,d) = 
(2,1,7,3, -1)7, 2(A,C,b,d) = (2,-1,3,3,-1)", J = {2,3,4,5} and Ky = {1,2}, K3 = {1,2}, 
Ky = {2,3} and Ks = {3}. x := #(A,C,b,d) = (2,-1,3,3,-1)! and H(x) = {1,4} and 
J Z H(x). We also have J := J \ H(x) = {2,3,5} and Ky U K3 UKs = K. Then set x1 = x4 = 
10-4 (say) and x = (10-4, -1,3,10-+, -1)?. Now H(x) = {2} and J := J\ H(x) = {3,5}. 
Since K3U Ks = K set x2 = 10~4(say) and we have x = (10~4,10~-4,3,10-4,—-1)". Now 
H(x) = {3} and J := J \ H(x) = {5}. Since Ks # K then we stop and an optimal solution is 
= (10 * 10-340 = 1)" and FO =. 
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6. A special case of max-linear program with two-sided constraints 


Suppose c = (c1,C2,...,Cm)!,d = (dy,do,...,dm)? € R",A = (a,j) and B = (bj) € IR™*" are 
given matrices and vectors. The system 


A®x@c=Bexod (14) 


is called non-homogeneous two-sided max-linear system and the set of solutions of this 
system will be denoted by S. Two-sided max-linear systems have been studied in [20], [21], 
[22] and [23]. 


Optimization problems whose objective function is max-linear and constraint (14) are called 
max-linear programs (MLP). Max-linear programs are studied in [24] and solution methods 
for both minimization and maximization problems were developed. The methods are proved 
to be pseudopolynomial if all entries are integer. Also non-linear programs with max-linear 
constraints were dealt with in [25], where heuristic methods were develeoped and tested for 
a number of instances. 


Consider max-linear programs with two-sided constraints (minimization), MLP™n 

f(x) =f? @x — min 

subject to (15) 

ASx@c=Be@xed 
where f = (fi,.-.,fa)’ € RY", ¢ = (¢1y..<,6m)", d = (d1,...,dm)" € BR”, A = (ay) and 
B = (b;;) € IR"*” are given matrices and vectors. We introduce the following: 

y= (fi @ X1, fo @ X2,..-, fn @ Xn) 
= diag(f) ® x 


diag(f) means a diagonal matrix whose diagonal elements are f;, fo,..., fn and off diagonal 
elements are —oo. It therefore follows from (16) that 


(16) 


fle@x=0' @y 
= x= (f[' Oy. fy! @y2---.fa @Yn) (17) 
= (diag(f)) " @y 
Hence, by substituting (16) and (17) into (15) we have 
07 @y — min 
subject to (18) 
A @y@c=B @yod, 
where 07 is transpose of the zero vector, A’ = A @ (diag(f))~! and B’ = B® (diag(f))~! 
Therefore we assume without loss of generality that f = 0 and hence (15) is equivalent to 
fro= © x; — min 
j=l 


subject to 


A®x@c=BO@xeGd 
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The set of feasible solutions for (19) will be denoted by S and the set of optimal solutions 
by s™n_ A vector is called constant if all its components are equal. That is a vector x € IR” is 
constant if x1 = x7 =--- = Xy. For any x € S we define the set Q(x) = {i€ M;(A®x); > cj}. 
We introduce the following notation of matrices. Let A = (aj) € R™", 1 <i, <in < ++ < 
ig <mand1 < jy < jo <-+-+ < jy <n. Then, 


Gin jy Ai jo oreee Ain ji, 
A (rare) a | Shih thi | = A(Q.R) 
Ji J2r-++s Jr Sng 
Ain jy Fin jon edie Gini, 
where, Q = {i1,...,ig}, R = {j1,...,jr}. Similar notation is used for vectors c(i1,...,ir) = 
(cj, ---¢;,)? = c(R). Given MLP™" with c > d, we define the following sets 


M~ = {ie M;c; = d;} and 
M> = {i € M;¢; > dj} 
We also define the following matrices: 
A= = A(M-,N), As = A(M7,N) 
B. = B(M-,N), Bs = B(M7,N) (20) 
c= =c(M7),cs =c(M7) 
An easily solvable case arises when there is a constant vector x € S such that the set Q(x) = @. 
This constant vector x satisfies the following equations and inequalities 
A~@x< ce 
As @x<ces 
B~@x<c= 


By @x=cy 


(21) 


where A=, As, B=, Bs, c= and cy are defined in (20). The one-sided system of equation and 
inequalities (21) can be written as 


G@x= 
2. (22) 
H@x<q 
where, 
A= 
G=(Bs),H=| As 
Ba 
(23) 
C= 
p=c> and q=|{cs 
ce 


Recall that 5(G, H, p,q) is the set of solutions for (22). 
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Theorem 9. Let Q(x) = © for some constant vector x = (w,...,«)? € S. Ifz € S™ then 
z € S(G,H, p,q). 


Proof. Let x = (a,...,0)' € S. Suppose Q(z) = ® and z € S™". This implies that f(z) < 
f(x) = a. Therefore we have, Vj € N, z < a. Consequently, z < x and (A ®z); < (A@x); 
for alli € M. Since, Q(z) = ®and z € S(G,H, p,q). 


Corollary 5. If Q(x) = © for some constant vector x € S then S™" C S™"(G,H, p,q). 


Proof. The statement follows from Theorem 9. 


7. Some solvability concepts of a linear system containing of both 
equations and inequalities 


System of max-separable linear equations and inequalities arise frequently in several branches 
of Applied Mathematics: for instance in the description of discrete-event dynamic system 
[1,4] and machine scheduling [10]. However, choosing unsuitable values for the matrix entries 
and right-handside vectors may lead to unsolvable systems. Therefore, methods for restoring 
solvability suggested in the literature could be employed. These methods include modifying 
the input data [11, 26] or dropping some equations [11]. Another possibility is to replace each 
entry by an interval of possible values. In doing so, our question will be shifted to asking 
about weak solvability, strong solvability and control solvability. 


Interval mathematics was championed by Moore [27] as a tool for bounding errors in 
computer programs. The area has now been developed in to a general methodology for 
investigating numerical uncertainty in several problems. System of interval equations and 
inequalities in max-algebra have each been studied in the literature. In [26] weak and strong 
solvability of interval equations were discussed, control sovability, weak control solvability 
and universal solvability have been dealt with in [28]. In [29] a system of linear inequality 
with interval coefficients was discussed. In this section we consider a system consisting of 
interval linear equations and inequalities and present solvability concepts for such system. 


An algebraic structure (B,@,®) with two binary operations © and ® is called max-plus 
algebra if 
B=RU{-o}, a@b=max{a,b}, a@b=a+b 


for any a,b ER. 
Let m,n,r be given positive integers and a € R, we use throughout the paper the notation 
M = {1,2,...,m}, N = {1,2,..,n},R = {1,2,...,r} and a—! = ~a. The set of allmxn,rxn 
matrices over B is denoted by B(m,n) and B(r,n) respectively. The set of all n-dimensional 
vectors is denoted by B(). Then for each matrix A € B(n,m) and vector x € B(n) the product 
A ® x is define as 

(A@x)= pe G + xj) 


For a given matrix interval A = [A, A] with A, A € B(k,n),A < A and given vector interval 
b = [b,b] with b,b € B(n),b < b the notation 


A@x=b (24) 
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represents an interval system of linear max-separable equations of the form 
A®Sx=b (25) 


Similarly, for a given matrix interval C = [A, A] with C,C € B(r,n),C < C and given vector 
interval d = [d,d] with d,d € B(n),b < b the notation 


C&x<d (26) 
represents an interval system of linear max-separable inequalities of the form 
C@ex<d (27) 


Interval system of linear max-separable equations and inequalities have each been studied in 
the literature, for more information the reader is reffered to . The following notation 


A®x=b 


28 
C®x<d a) 


represents an interval system of linear max-separable equations and inequalities of the form 


A®x=b 


29 
Cex<d 


where A € A,CE€C,b €bandd ed. 


The aim of this section is to consider a system consisting of max-separable linear equations and 
inequalities and presents some solvability conditions of such system. Note that it is possible 
to convert equations to inequalities and conversely, but this would result in an increase in 
the number of equations and inequalities or an increase in the number of unknowns thus 
increasing the computational complexity when testing the solvability conditions. Each system 
of the form (29) is said to be a subsystem of (28). An interval system (29) has constant matrices 
if A = A and C = C. Similarly, an interval system has constant right hand side if b = b and 
d = d. In what follows we will consider A € IR(k,n) and C € R(r,n). 


7.1. Weak solvability 


Definition 8. A vector y is a weak solution to an interval system (29) if there exists A € A,C € 
C,b € band d € d such that 


A®y=b 


(30) 
C@y<d 


Theorem 10. A vector x € R” is a weak solution of (29) if and only if 


and 
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Proof. Let i = {1,...,m} be an arbitrary chosen index and x = (x1,%2,..., Xn)? € IR" fixed. If 
A € Athen (A © x); is isotone and we have 


(A@x); € [((A®x);,(A®x)] CR 
Hence, x is a weak solution if and only if 
[(A® x);, (A @ x)j] 9 [bj, bi] (31) 
Similarly, if C@ x < d then x is obviously a weak solution to 


A@x<b 
7 (32) 
C@x<d 


That is 


Also from (31) x is a weak solution if and only if 


[(A® x);, (A @ x)j] N [b;, bi] 4 @, Vi = 1,2,...,m 


That is 


Definition 9. An interval system (29) is weakly solvable if there exists A ¢ A,C € C,beb 
and d € d such that (29) is solvable. 


Theorem 11. An interval system (29) with constant matrix A = A = A, C = C = Cis weakly 
solvable if and only if 


be solvable subsystem for b € [b;,b;]. Then we have 


Ab Ab 
X = > X =b> 
A@s(25)24e2( ) b>b 
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7.2. Strong solvability 


Definition 10. A vector x is a strong solution to an interval system (29) if foreach A € A,C € C 
and each b € b,d € d there is an x € R such that (29) holds. 


Theorem 12. a vector x is a strong solution to (29) if and only if it is a solution to 


E@x=f 
Ca@x<d 


<Q) 


Proof. If x is a strong solution of (29), it obviously satisfies (33). Conversely, suppose x satisfies 
(33) and let A € A,C € Cb € bd € d such that A@x 4 band C@x > d. Thendi € 
(1,2,...,m) such that either (A @ x); < 5; or (A@x); > b; and (C@x); > dj. Therefore, 
(A@®x); < (A@x); < bj, (A@x); > (A@x); > bj and (C@x); > (C@x); > dj and the 
theorem statement follows. 


where 
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1. Introduction 


Multibody dynamics methods are being used extensively to model biomolecular systems 
to study important physical phenomena occurring at different spatial and temporal scales 
[10, 13]. These systems may contain thousands or even millions of degrees of freedom, 
whereas, the size of the time step involved during the simulation is on the order of femto 
seconds. Examples of such problems may include proteins, DNAs, and RNAs. These highly 
complex physical systems are often studied at resolutions ranging from a fully atomistic 
model to coarse-grained molecules, up to a continuum level system [4, 19, 20]. In studying 
these problems, it is often desirable to change the system definition during the course of the 
simulation in order to achieve an optimal combination of accuracy and speed. For example, 
in order to study the overall conformational motion of a bimolecular system, a model based 
on super-atoms (beads) [18, 22] or articulated multi-rigid and/or flexible body [21, 23] can be 
used. Whereas, localized behavior has to be studied using fully atomistic models. In such 
cases, the need for the transition from a fine-scale to a coarse model and vice versa arises. 
Illustrations of a fully atomistic model of a molecule, and its coarse-grained model are shown 
in Fig. (1-a) and Fig. (1-b). 


Given the complexity and nonlinearity of challenging bimolecular systems, it is expected that 
different physical parameters such as dynamic boundary conditions and applied forces will 
have a significant affect on the behavior of the system. It is shown in [16] that time-invariant 
coarse models may provide inadequate or poor results and as such, an adaptive framework 
to model these systems should be considered [14]. Transitions between different system 
models can be achieved by intelligently removing or adding certain degrees of freedom 
(dof). This change occurs instantaneously and as such, may be viewed as model changes 
as a result of impulsively applied constraints. For multi-rigid and flexible body systems, the 
transition from a higher fidelity (fine-scale) model to a lower fidelity model (coarse-scale) 
using divide-and-conquer algorithm (DCA) has been studied previously in [8, 12]. DCA 
efficiently provides the unique states of the system after this transition. In this chapter, we 
focus on the transition from a coarse model to a fine-scale model. When the system is modeled 
in an articulated multi-flexible-body framework, such transitions may be achieved by two 
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(a) Fully atomistic model 


Flexible body representation of a group of atoms 


Kinematic joint 


Rigid body representation of a 


a group of atoms > 


(b) Mixed type multibody model 


Figure 1. Illustration of a biomolecular system. a) Fully atomistic model. b) Coarse grain model with 
different rigid and flexible sub-domains connected to each other via kinematic joints. 


different means. In the first, a fine-scale model is generated by adding flexible dof. This type of 
fine scaling may be necessary in order to capture higher frequency modes. For instance, when 
two molecules bind together, due to the impact, the higher frequency modes of the system 
are excited. The second type of fine scaling transition may be achieved through releasing the 
connecting joints in the multi-flexible-body system. In other words, certain constraints on 
joints are removed to introduce new dof in the model. 


In contrast to those types of dynamic systems in which the model definition is persistent, and 
the total energy of the system is conserved, the class of problems discussed here experiences 
discontinuous changes in the model definition and hence, the energy of the system must 
also change (nominally increase) in a discontinuous fashion. During the coarse graining 
process, based on a predefined metric, one may conclude that naturally existing higher 
modes are less relevant and can be ignored. As such, the kinetic energy associated with 
those modes must be estimated and properly accounted for, when transitioning back to the 
fine-scale model. Moreover, any change in the system model definition is assumed to occur 
as a result of impulsively applied constraints, without the influence of external loads. As 
such, the generalized momentum of the system must also be conserved [6]. In other words, 
the momentum of each differential element projected onto the space of admissible motions 
permitted by the more restrictive model (whether pre- or post-transition) when integrated 
over the entire system must be conserved across the model transition. If the generalized 
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momentum is not conserved during the transition, the results are non-physical, and the new 
initial conditions for the rest of the simulation of the system are invalid. 


In the next section, a brief overview of the DCA and analytical preliminaries necessary to the 
algorithm development are presented. The optimization problem associated with the coarse 
to fine scale transitioning is discussed next. Then the impulse-momentum formulation for 
transitioning from coarse models to fine-scale models in the articulated flexible-body scheme 
is presented. Finally conclusions are made. 


2. Theoretical background 


In this section, a brief introduction of the basic divide-and-conquer algorithm is presented. 
The DCA scheme has been developed for the simulation of general multi-rigid and 
multi-flexible-body systems [5, 8, 9], and systems with discontinuous changes [11, 12]. The 
basic algorithm described here is independent of the type of problem and is presented so that 
the chapter might be more self contained. In other words, it can be used to study the behavior 
of any rigid- and flexible-body system, even if the system undergoes a discontinuous change. 
Some mathematical preliminaries are also presented in this section which are important to the 
development of the algorithm. 


2.1. Basic divide-and-conquer algorithm 


The basic DCA scheme presented in this chapter works in a similar manner described in detail 
in [5, 9]. Consider two representative flexible bodies k and k + 1 connected to each other by a 
joint J’ k as shown in Fig. (2-a). The two points of interest, ist and He, on body k are termed 
handles. A handle is any selected point through which a body interacts with the environment. 
In this chapter, we will limit our attention to each body having two handles, and each handle 
coincides with the joint location on the body, i.e. joint locations J! and JK in case of body 
k. Similarly, for body k + 1, the points | and mrt are located at the joint locations J* 
and J k+1 respectively. Furthermore, large rotations and translations in the flexible bodies are 
modeled as rigid body dof. Elastic deformations in the flexible bodies are modeled through 
the use of modal coordinates and admissible shape functions. 


DCA is implemented using two main processes, hierarchic assembly and disassembly. The 
goal of the assembly process is to find the equations describing the dynamics of each body 
in the hierarchy at its two handles. This process begins at the level of individual bodies and 
adjacent bodies are assembled in a binary tree configuration. Using recursive formulations, 
this process couples the two-handle equations of successive bodies to find the two-handle 
equations of the resulting assembly. For example, body k and body k + 1 are coupled together 
to form the assembly shown in Fig. (2-b). At the end of the assembly process, the two-handle 
equations of the entire system are obtained. 


The hierarchic disassembly process begins with the solution of the two-handle equations 
associated with the primary node of the binary tree. The process works from this node to the 
individual sub-domain nodes of the binary tree to solve for the two-handle equations of the 
constituent subassemblies. This process is repeated until all unknowns (e.g., spatial constraint 
forces, spatial constraint impulses, spatial accelerations, jumps in the spatial velocities) of the 
bodies at the individual sub-domain level of the binary tree are known. The assembly and 
disassembly processes are illustrated in Fig. (3). 
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(b) Assembly k: k +1 


Figure 2. Assembling of the two bodies to form a subassembly. a) Consecutive bodies k andk+1.b) A 
fictitious subassembly formed by coupling bodies k and k + 1. 


Single assembly - Primary system node 


<a 14+2+3+4+5 > 
Resulting subassemblies - Intermediate nodes 


Individual bodies - Leaf or individual body nodes 


ED EDEDEDED 


Figure 3. The hierarchic assembly-disassembly process in DCA. 


HIERARCHIC ASSEMBLY 
ATAWASSVSIC OIHOUVUIH 


2.2. Analytical preliminaries 


For convenience, the superscript c shows that a quantity of interest is associated with the 
coarse model, while f denotes that it is associated with the fine model. For example, the 
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c a 
column matrix ? represents the velocity of handle-1 in the coarse model, and F 


represents the velocity of the same handle in the fine-scale model. In these matrices, v, and 
qi are the spatial velocity vector of handle-1 and the associated generalized modal speeds, 
respectively. 


As discussed previously, the change in the system model definition may occur by changing 
the number of flexible modes used to describe the behavior of flexible bodies, and/or the 
number of dof of the connecting joints. To implement these changes in the system model 
mathematically, the joint free-motion map is defined as follows. 


k 
The joint free-motion map Ph can be interpreted as the 6 x v* matrix of the free-modes of 


motion permitted by the vk degree-of-freedom joint, Jk . In other words, pl maps vk x 1 
generalized speeds associated with relative free motion permitted by the joint into a 6 x 1 
spatial relative velocity vector which may occur across the joint, J k [5]. For instance, consider 
a transition in which a spherical joint in the system is altered, where only one dof is locked 
about the first axis. The joint free-motion maps of the fine and coarse models in this case are 
shown in the following: 


100 00 
010 10 

ee ee ee 

Fx 10.00) #78 — 1.00 (1) 
000 00 
000 00 


We define the orthogonal complement of the joint free-motion map, pe. As such, by definition 
one arises at the following 


k 
(DE)T PL =0 (2) 


3. Optimization problem 


Any violation in the conservation of the generalized momentum of the system in the transition 
between different models leads to non-physical results since the instantaneous switch in the 
system model definition is incurred without the influence of any external load. In other words, 
the momentum of each differential element projected onto the space of the admissible motions 
permitted by the more restrictive model (whether pre- or post-transition) when integrated 
over the entire system must be conserved across the model transition [6]. Jumps in the system 
partial velocities due to the sudden change in the model resolution result in the jumps in the 
generalized speeds corresponding to the new set of degrees of freedom. Since the model is 
instantaneously swapped, the position of the system does not change. Hence, the position 
dependent forces acting on the system do not change, and do not affect the generalized 
speeds. Any change in the applied loads (e.g., damping terms) which might occur due 
to the change in the model definition and the associated velocity jumps do not contribute 
to the impulse-momentum equations which describe the model transition. This is because 
these changes in the applied loads are bounded, and integrated over the infinitesimally short 
transition time. 


Consider a fine-scale model with n dof. Let the dof of the model reduce to n — m after 
the imposition of certain instantaneous constraints. In this case, the conservation of the 
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generalized momentum of the system is expressed as 
Lo ae (3) 


In the above equation, L°/¢ and Lf/¢ represent the momenta of the coarse and fine models, 
respectively, projected on to the space of partial velocity vectors of the coarse model. Equation 
(3) provides a set of n — m equations which are linear in the generalized speeds of the coarse 
model and solvable for the unique and physically meaningful states of the system after the 
transition to the coarser model. 


Now consider the case in which, the coarse model is transitioned back to the fine-scale model. 
Equation (3) is still valid, and provides n — m equations with n unknown generalized speeds 
of the finer model. Furthermore, during the coarsening process, the level of the kinetic 
energy also drops because we chose to ignore certain modes of the system. However, in 
actual biomolecular systems such a decrease in energy does not happen. Consequently, it is 
important to realize the proper kinetic energy when transitioning back to the fine-scale model. 
Therefore, the following equation must be satisfied 


KEf = 5 (uh) Mu! (4) 


In the above equation uf is the n x 1 column matrix containing the generalized speeds of the 
fine model, and M represents the generalized inertia matrix of the fine model. It is clear that 
Eqs. (3) and (4) provide n — m + 1 equations with n unknowns. This indicates that the problem 
is under-determined when multiple dof of the system are released. We may arrive at a unique 
or finite number of solutions, solving the following optimization problem 


Optimize J (uf, t) (5) 
Subjected to @;(uf,t) =0,i=1,---,k (6) 


In the above equation, J is the physics- or knowledge- or mathematics-based objective 
function to be optimized (nominally minimized) subjected to the constraint equations ©;. In 
[1, 15], different objective functions are proposed for coarse to fine-scale transition problems. 
For instance, in order to prevent the generalized speeds of the new fine-scale model from 
deviating greatly from those of the coarse scale model, we may minimize the L? norm of the 
difference between the generalized speeds of the coarse and fine scale models as follows 


J = (uf —u°)T (uf —u°) (7) 


As indicated previously, (n — m) constraint equations governing the optimization problem 
are obtained from the conservation of the generalized momentum of the system within the 
transition. The rest of the constraint equations are obtained from other information about the 
system, such as the specific value of kinetic energy or the temperature of the system. 


The generalized momenta balance equations from Eq. (3) are expressed as 
Au =b (8) 


where A and b are (n — m) x n and n known matrices, respectively, and w/ is ann x 1 column 
matrix of the generalized speeds of the fine-scale system model. As a part of the optimization 
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problem, one must solve this linear system for n — m dependent generalized speeds in terms 
of m independent generalized speeds. Therefore, the optimization is performed on a much 
smaller number (m) variables, with a cost of O(m?). For a complex molecular system, could 
be very large, and n >> m, hence a significant reduction is achieved in the overall cost 
of optimization as compared to other traditional techniques, such as Lagrange multipliers 
[3]. However, the computations required to find the relations between dependent and 
independent generalized speeds can impose a significant burden on these simulations. It is 
shown in [2] that if traditional methods such as Gaussian elimination or LU factorization are 
used to find these relations, this cost tends to be O(n(n — m)*). The DCA scheme provided 
here finds these relations at the end of the hierarchic disassembly process with computational 
complexity of almost O(n) in serial implementation. In other words, in this strategy, DCA 
formulates the impulse-momentum equations of the system which is followed by providing 
the relations between dependent and independent generalized speeds of the system in a 
timely manner. As such, this significantly reduces the costs associated with forming and 
solving the optimization problem in the transitions to the finer models. 


4. DCA-based momenta balance for multi-flexible bodies 


In this section, two-handle impulse-momentum equations of flexible bodies are derived. 
Mathematical modeling of the transition from a coarse model to a fine-scale model is 
discussed. For the fine-scale to coarse-scale model transition in multi-flexible-body system the 
reader is referred to [7, 17]. We will now derive the two-handle impulse-momentum equations 
when flexible degrees of freedom of a flexible body or the joints in the system are released. 
Then, the assembly of two consecutive bodies for which the connecting joint is unlocked is 
discussed. Finally, the hierarchic assembly-disassembly process for the multi-flexible-body 
system is presented. 


4.1. Two-handle impulse-momentum equations in coarse to fine transitions 


Now, we develop the two-handle impulse-emomentum equations for consecutive flexible 
bodies in the transition from a coarse model to a fine-scale model. It is desired to develop the 
handle equations which express the spatial velocity vectors of the handles after the transition 
to the finer model as explicit functions of only newly introduced modal generalized speeds 
of the fine model. For this purpose, we start from the impulse-momentum equation of the 


flexible body as 
Cc tr Cc te 
Tif —Tevg = [78] | Fedt-+ [7] j Foedt (9) 
YF 41 Jt. YF Jo Jt. 


where If and I are the inertia matrices associated with the fine-scale and coarse models, 
respectively. Also, t, and t f Tepresent the time right before and right after the transition. The 


quantities f, ia F,,dt and f, : Fy-dt are the spatial impulsive constraint forces on handle-1 and 
c 


fc 
handle-2 of the flexible body. The matrices Ee and i are the coefficients resulting 
F 2 


YF 
from the generalized constraint force cautabaton at handle-1 and handle-2, respectively. 
Moreover, in Eq. (9), the impulses due to the applied loads are not considered since they 
represent a bounded loads integrated over an infinitesimal time interval. For detailed 
derivation of these quantities the reader is referred to [8]. It is desired to develop the handle 
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equations which provide the spatial velocity vectors of the handles right after the transition to 
the fine-scale model in terms of newly added modal generalized speeds. Therefore, in Eq. (9), 
the inertia matrix of the flexible body is represented by its components corresponding to rigid 
and flexible modes, as well as the coupling terms 


f f Cc rts Cc rts 
ferte Tl - (Lm. ne 
Ter Tre q YF \4 Jt. YF lo Jt 


TR sl ell 
si : 10 
Ee Tre q on) 

which is decomposed to the following relations 
f ‘f a fot 
eee i. Fiedt + 1% / Fedt +TSpof +Teng° — Tego! (11) 
f tf of f f 

Te, = a [ edt + vf f Focdt + TRpvf + Teed’ — Tap gl (12) 


Since the generalized momentum equations are calculated based on the projection onto the 
space of the coarser model, the matrix If is nota square matrix and thus rf rr is not invertible. 
However, we can partition Eq. (11) in terms of dependent (those associated with the coarser 
model) and independent (newly introduced) generalized speeds as 


qi ty ty 
tpi -+ | = 46 Fi -dt 7 Fo-dt 
rf, : rf, | a vn | 1c + f 2c 
Tene reg 10, (13) 
Using the above relation, the expression for the dependent generalized modal speeds is 
written as 
fa)—1 - f 
af = (CYT h [Blt + ia [Feat + Tin 
+ Ferd’ — Thee} —Thp9" (14) 
Defining 
Th =| pf: pf 15 
RE Cae ple (15) 
a rr) elt (16) 
C1 = [ve — Lee nee ie ‘ye (17) 
Co = [Yr - re aes Te oY] (18) 
C3 = [Par Ve (The) Verlog + Pe — Phe (Ee) Teel d® (19) 
G4 = ie ee aes = | (20) 


Aqi = AGi, (i= 1,2,3,4) (21) 
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and using Eggs. (12) and (14), the spatial velocity vector of handle-1 can be written in terms of 
the independent modal speeds 


of =An is Fidt + Ayo i Focdt + Aq3 + Aqagh (22) 
As such, the spatial velocity vector of handle-2 becomes 
oh = (SH )Tol + gh gf (23) 
Employing the same partitioning technique, Eqs. (23) can be written as 
gf 
v = (see + E; : of |: | (24) 
gfi 
= 0h = (SP )T ot + gn glt + nigh (25) 
Using 
Aan = [(S)T An + 65! CEE) 9h — G8! (Che) Teed] (26) 
ag = [(S)T Aga + $4! (Che) fe — oN (fe) Tee Anal (27) 
Aas = [SIM)"Ags + $3! (Pe) Teeof + 05 (Cee) Tied! 
— $f! (Cp) Teena] (28) 
Ang = (SI )T Ara + of! — 8 (CE) — of! CG) Tea] (29) 


and Eq. (25), the spatial velocity vector of handle-2 can be written as 


t t 
oh = | : Fy.dt + A22 / : Fo-dt + Ag3 + AraGii (30) 
Equations (22) and (30) are now in two-handle impulse-momentum form and along with 
Eq. (14), give the new velocities associated with each handle after the transition. These 
equations express the spatial velocity vectors of the handles of the body as well as the modal 
generalized speeds which have not changed within the transition in terms of the newly added 
modal generalized speeds. This important property will be used in the optimization problem 
to provide the states of the system after the transition to the finer models. 


As such, the two-handle equations describing the impulse-momentum of two consecutive 
bodies, body k and body k + 1 are expressed as 


t t 

A= ap [Aa rap [Para al ralQa e 
t t 

am Ald [rian AD fan a + alta 62 


t t 
okt DF — ak) 7 eggs oD i apa YE AGD g(e+fi (33) 
t 


c c 


ce 4 ff ay ee tp ake 
DTUF lett) [ Fa 4 ak = BEM ae 4 AG) 4 ASD eh (aay 
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4.2. Assembly process and releasing the joint between two consecutive bodies 


In this section, a method to combine the two-handle equations of individual flexible bodies 
to form the equations of the resulting assembly is presented. Herein, the assembly process 
of the consecutive bodies is discussed only within the transition from a coarse model to 
a finer model. This transition is achieved by releasing the joint between two consecutive 


bodies. Clearly, this would mean a change in the joint free-motion map pl and its orthogonal 


k 
complement De . It will become evident that the assembly process of the consecutive bodies 
for the fine to coarse transition is similar, and the associated equations can be easily derived 
by following the given procedure. 


From the definition of joint free-motion map, the relative spatial velocity vector at the joint 
between two consecutive bodies is expressed by the following kinematic constraint 


oF — of = pif yk/k+1)f (35) 


In the above equation, ulk/K+DF is the relative generalized speed defined at the joint of the 
fine model. From Newton’s third law of motion, the impulses at the intermediate joint are 


related by 
ff tf 
i FM at = — d: FRM at (36) 
Substituting Eqs. (32), (33) and (36) into Eq. (35) results in 


£ t t 
k+1) k) f (k+1) k) [i ak k+1) [f .(k+1) 
A 4 AW) [eee a AW [Bara ("ea 


c 


+ A) — Ae) 4 AW gos — y 


k 
(k (e+) a(ke) fi sts Pht (h/t f (37) 


Using the definition of the joint free-motion map, the spatial constraint impulses lie exactly 
in the space spanned by the orthogonal complement of joint free-motion map of the coarser 
model. These constraint impulses can be expressed as 


fh (kt ke ff (ke 
FAD dt = Die | FE) ae (38) 


k+1) 


dt is an ordered measure number of the impulsive constraint 


In the above equation, /; : F\ 


torques and forces. Pre-multiplying Eq. (37) by (D), °)", one arrives at the expression for 
fe FAD at as 


tr ty tf 
(k+1) 3, (k) (k) (k+1) (k+1) 
| FD ge = xalk | Far — xalk | FD at 


c 


+ XY 4 XAM GMA — XD DA 4 xplf ylK/eHD)f (39) 

where 
X= DR (DR) (Aft? + Asp Dg] (DR) (40) 
Y= Ag) AyD (41) 
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Using Eqs. (31), (34), and (39), we write the two-handle equations for the assembly k: k +1 


pke+f _ wp hk [ ptt dt 4 ve) a pct Fr 
a python 04 + ape) g(t fi ED yk F (40) 

oleRt DE z yet [ ztk dt 4 weet) [ prt dt 
er python 0 + ape g(t fi ED yk F (43) 

where: 

pd) — A) _ x, @) (44) 
yeaa ea (45) 
wD) — 4 _ AW xy (46) 
pled) — 4) _ Wx, @) (47) 
PED @ AW x,y) (48) 
piektd) _ _ A yphf (49) 
ist = 60) 
wif = ghey sp 
ce eer aL ae sale (52) 
PD — AD (53) 
wet = Ate Ab ae ep 
pSeked) _ ye) ypl'f (55) 


The two-handle equations of the resultant assembly express the spatial velocity vectors of 
the terminal handles of the assembly in terms of the spatial constraint impulses on the same 
handles, as well as the newly added modal generalized speeds of each constituent flexible 
body, and the newly introduced dof at the connecting joint. These are the equations which 
address the dynamics of the assembly when both types of transitions occur simultaneously. In 
other words, they are applicable when new flexible modes are added to the flexible constituent 
subassemblies and new degrees of freedom are released at the connecting joint. If there is 
no change in the joint free-motion map, the spatial partial velocity vector associated with 
ulk/k+DF does not appear in the handle equations of the resulting assembly. 


5. Hierarchic assembly-disassembly 


The DCA is implemented in two main passes: assembly and disassembly [8, 9]. As mentioned 
previously, two consecutive bodies can be combined together to recursively form the handle 
equations of the resulting assembly. As such, the assembly process starts at the individual 
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sub-domain level of the binary tree to combine the adjacent bodies and form the equations 
of motion of the resulting assembly. This process is recursively implemented as that of the 
binary tree to find the impulse-momentum equations of the new assemblies. In this process, 
the spatial velocity vector (after transition) and impulsive load of the handles at the common 
joint of the consecutive bodies are eliminated. The handle equations of the resulting assembly 
are expressed in terms of the constraint impulses and spatial velocities of the terminal handles, 
as well as the newly introduce modal generalized speeds and generalized speeds associated 
with the newly added degrees of freedom at the connecting joints. This process stops at the 
top level of the binary tree in which the impulse-momentum equations of the entire system 
are expressed by the following two-handle equations 


: fr : tf : 
of — yim / Flat +9") i; Fat + ¥e” 
t 


c c 


+ yy" gens + ln) (nf (56) 


. tf . tr ; 
of = yn ) Fiat +49") / Frat + ¥” 
te t 


c 


a pln gins + el") yen) f (57) 


(1:n) 
ij 
are linear in terms of the newly added generalized modal speeds as well as the generalized 
speeds associated with the released dof at the joints of the system. 


Note that through the partial velocity vectors ¥ ,(i = 1,2andj = 4,5), these equations 


The two-handle equations for the assembly at the primary node is solvable by imposing 
appropriate boundary conditions. Solving for the unknowns of the terminal handles initiates 
the disassembly process [1, 11]. In this process, the known quantities of the terminal handles 
of each assembly are used to solve for the spatial velocities and the impulsive loads at the 
common joint of the constituent subassemblies using the handle equations of each individual 
subassembly. This process is repeated in a hierarchic disassembly of the binary tree where 
the known boundary conditions are used to solve the impulse-momentum equations of the 
subassemblies, until the spatial velocities of the fine model and impulses on all bodies in the 
system are determined as a known linear function of the newly introduced generalized speeds 
of the fine model. 


6. Conclusion 


The method presented in this chapter is able to efficiently simulate discontinuous changes in 
the model definitions for articulated multi-flexible-body systems. The impulse-momentum 
equations govern the dynamics of the transitions when the number of deformable modes 
changes and the joints in the system are locked or released. The method is implemented 
in a divide-and-conquer scheme which provides linear and logarithmic complexity when 
implemented in serial and parallel, respectively. Moreover, the transition from a coarse-scale 
to a fine-scale model is treated as an optimization problem to arrive at a finite number of 
solutions or even a unique one. The divide-and-conquer algorithm is able to efficiently 
produce equations to express the generalized speeds of the system after the transition to the 
finer models in terms of the newly added generalized speeds. This allows the reduction in 
computational expenses associated with forming and solving the optimization problem. 
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